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(57) Abstnict; ITiis Hpplication relates to libraries (if AB.x toji iii muisnt's, in which a peptide insert is introduced into the prolease- 
icrisitive loop of the A-cfeiin sequence to alter the type of ceils to which t»ixic sjjeties are delivered. Said JibOTie.ii are used in the 
development of iherapeatics targeted against .xpedfie cdl types. 
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LIBRARY FROM TOXIN MUTANTS. AND METHODS OF USING SAME 

Background of the fa ventton 

This application relates to libraries of loxin iridtants, and to methods of using same in 
the development of therapeutics targeted against specific ceil types. 

Plant and bacterial toxins have a structitral organization with two or more po}>'peptide 
domains or subunits responsible for distinct functions, referred to as A and B. The toxins may 
be referred to as ABx toxins where x represents the number of identical or homologous B 
subunits in the toxin. This family of framework-reiated toxins inciudes examples such as 
Shiga and Shiga-like toxins, the E. coli heat-labile enterotoxins, cholera toxin, diphtheria 
toxin, pertussis toxin, Pscudomonas aeruginosa exotoxin A (Oisnes, S. and Sandvik, K. 
(1988) in Immunotoxins pp, 39-73, Kluwer Academic, Boston; Sandvik, K,, Dubinina, E., 
Garred, O., et al. (1992) Biochem. Soc. Trans. 20:724) as well as plant toxins such as ricin 
and abrin, In some cases the toxin are heteromeric, in that tlie B chains are actually separate 
entities tliat connect to the toxic A chain via a non-covalent bonding. In other cases, the toxin 
is monomeric, since the B chain is part ol Uie same protein when the toxin is produced in 
nature. In many cases, the A chain has been characterized as having two domains, an A 1 
domain and an A2 domain. 

Based on their ability to block protein sjiithesis, proteins such as Shiga and Shiga-like 
toxins as well as ricin, abrin, geionic, crotin, pokeweed antiviral protein, saporin, raomordin, 
modeccin, sarcin, diphtheria toxin and exotoxin A have been referred to as 
ribosome-inactivating proteins (RIP). The potency of RIPs is exceedingly high; one molecule 
of diphtheria toxin A chain ( Yamaizumi, et al, ( 1 978) Cell 1 5 :245-250) or ricin A chain 
(Eiklid, et al. (1980) Exp. Cell Res. 126:321-326) having been shown to be sufficient to kill a 
eukaryotic ceil. 

International Patent Publication No. WO99/40185 describes libraries of mutant toxins 

in which mutations are introduced into the binding domain to alter the type of cells to which 
the toxic species are delivered. The new proteins are derived by mutating a binding subunit 
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of the wild t>pe heteromeric protein e>totoxic protein to create a library of microorganism 
clones producing mutant proteins, which are then screened for the ability to specifically bind 
to and kill a target cell type. 

US Patent No. 5,552,144 discloses a Shigella-like toxin II variant to which a mutation 
is introduced into the A chain at position 167 to change the amino acid at this position to one 
with a diiYerent charge. This resulted in a toxin with less of the enzymatic activity associated 
with toxicity. 

US Patent No. 6,593,132 describes recombinant toxic proteins which are specifically 

toxic to diseased cells but do not depend for their specificity of action on a specific cell 
binding component. The recombinaiit proteins of the '132 patent have an A chain of a 
ricin-like toxin linked to a B chain by a ssnrhetic linker sequence which may be cleaved 
specifically by a protease localized in cells or tissues affected by a specific disease to liberate 
the toxic A chain thereby selectively inhibiting or destro>'ing the diseased cells or tissues. 

US Patent No. 6,649,742 discloses Type 1 ribosome-inactivating proteins (RIPs) and 
aiialogs of the RIPs having a cysteine available for disulfide bonding to targeting molecules. 
The RIPs and RIP analogs are used as components of cytotoxic therapeutic agents to 
selectively eliminate any cell type to which the RIP component is targeted by the specific 
binding capacity of the second component of the agent. 

PCT apphcation PCT/CA2004/000433 by the present inventors discloses 
combinatorial protein libraries comprising a plurality of protein species, in which each protein 

species comprises an A chain of a heteromeric toxic protein, into wiiich an insert has been 
introduced. 

Summary of the Invention 
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An aspect of the present invention is a combinatorial protein library comprising a plurality of 
protein species, each protein species comprising an A chain of a toxic protein into vdikh m 
insert has been introduced, wherein, (a) the insert is a polypeptide of varying amino acid 
sequence having a length of at least 2 amino acid residues; and (b) the insert is introduced 
into the protease-sensitive loop of the A chain sequence. In a further embodiment, the library 
may comprise up to or over 100 protein species, and may be formed by introducing the insert 
into a Shiga-like toxin lA chain, for example, between amino acids 242 and 261, such as 
betwe^ amino acids 245 and 246, 

In a further embodiment of the present invention, the insert has a length of 7 amino acids. 

Another aspect of the present invention is a niutant protesn comprising an A chain of a toxic 
protein into which an insert has been introduced, wherein the insert is a polypeptide of 
varying amino acid sequence having a length of at least 2 amino acid residues; and the insert 
is introduced into the protease-sensitive loop of the A chain sequence. In a ftirther 
embodiment, the A chain can be a Shiga-like toxin 1 A chain. In a further embodiment, the 
insert may be introduced between amino acids 242 and 261, such as between amino acids 245 
and 246. 

A further aspect of the present invention is a method for identifying a Jigand that bind to a 
specific target/receptor, comprising the steps of exposing cells known to possess the 
target/receptor to members of a combinatorial protein library as described herein; selecting 
members of the protein librasy which are observed to be toxic to the cells; evaluating the 
selected members of the protein library- to determine the sequence of the inserted region, 
whereby a peptide of the sequence of the inseried region is identified as a possible ligand for 
a target/receptor on the cell; and Slirther testing peptides of the sequence of the inserted 
region to confum thai they are a ligand for the specific target/'receptor. 

A further aspect of the present invendon is a method for isolating a toxin specific for a known 
target/receptor comprising the steps of: exposing the target/receptor to a combinatorial protein 
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library as described herein; and isolating at least one protein species trom the combinatorial 
protein library captured by binding to the target/receptor. 

A further aspect of the present invention is a combinatoriai protein library comprising a 
plurality of protein species, each protein species coniprisang a modified A chain of a toxic 
protein into which an insert has been introduced, wherein, the modified A chain comprises a 
wild-type A chain containing a mutation in at least one cysteine; the insert is a polvpeptide of 
varying amino acid sequence having a length of at least 2 amino acid residues; and the insert 
is introduced into tlie region which would be a protease-sensttivc loop in the wild-type A 
chain sequence. 

A turther aspect of the present invention is a muttUit protein comprising a modified A chain of 
a toxic protein into which an insert has been introduced, wherein, the modified A chain 
comprises a wild-t>pe A chain containing a mutation in ai least one cjrateine; the insert is a 
polypeptide of vaiying amino acid seqiience having a length of at least 2 amino acid residues; 
and the ii^ert is introduced into a region which would be a protease-sensitive loop in the wild 
type A chain sequence. In a further embodiment, the modified A chain further comprises a 
label bound to a cysteine. 

An other embodiment of the present invention is a pol>peptide sequence selected from the 
group consisting of the polypeptide sequence listed in figures IIC, 1 ID, 1 2A, 12B, 12C, or 
12D. 

An other embodiment of the present invention is amethod for identifying a ligand that bind to 
a specific target'receptor, comprising the steps of: exposing cells known to possess the 
target/'receptor to members of a combinatorial protein librarj' as described herein; selecting 
members of the protein library which are observed to be toxic to the cells; evaluating the 
selected members of the protein library to determine the sequence of the inserted region, 
whereby a peptide of the sequence of the inserted region is identified as a possible ligand for 
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a target/'receptor on the cell; and further testing peptides of the sequence of the inserted region 
to confirm that they are a ligand for the specific target/receptor. 

An other embodiment of the present invention is a method for isolating a toxin specillc for a 
known target/receptor comprising the steps of: exposing the target/receptor to a combinatorial 
protein library ss described herein; and isoiaiing at least one protein species from the 
combinatorial protein librar>' captured by binding to the target/receptor. 

An other embodimeni of the present invention is a combinatorial protein librsuy comprising a 
plurality of protein species, each protein species comprising an A 1 chain of a toxic protei n 
into which an insert has been introduced, wherein, the insert is a polypeptide of varying 
amino acid sequence having a length of at least 2 amino acid residues; ;uid the insert is 
introduced into the portion of the Al chain that would be a protease-sensitive loop if the Al 
chain was a full-length A chain. 

An other embodiment of the present invention is a mutant protein comprising an Al chain of 
a toxic protein into which an insert has been introduced, wherein, the insert is a polypeptide 
of varying amino acid sequence having a length of at least 2 amino acid residues; and the 
insert is introduced into the portion of the Al chain that would be a protease-sensitive loop if 
the AT chain was a full length A chain. 

Another erabodim ent of the present invention is a method for identifying a ligand that bind to 
a specific target/receptor, comprising the steps of: exposing cells known to possess the 
target/receptor to members of a combinatorial protein library as described herein; selecting 
members of the protein library wliich are observed to be toxic to the cells; evaluating the 
selected menibers of the protein library to detennme the sequence of the inserted region, 
whereby a peptide of the sequence of the inserted region is identified as a possible ligand for 
a target/receptor on the ceil; and further testing peptides of the sequence of the inserted region 
to confirm that they are a ligand for the speci iic target/receptor. 
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Another embodiment of the present invention is a method for isolating a toxin specific for a 
known target/receptor comprising the steps of: exposing the target/receptor to a combinatorial 
protein library as disclosed herein; and isolating at least one protein species from the 
combinatorial protein library captured by binding to the target/receptor. 

A further embodiment of the present invention is a combinatorial protein library comprising a 
plurality of protein species, each protein species comprising a modified A I chain of a toxic 
protein into wl«ch an insert has been introduced, wherein, the modified Al chain comprises a 
wild-type Al chain containing a mutation in one cysteine; the insert is a polj^eptide of 
varying amino acid sequence having a length of at least 2 amino acid residues; and the insert 
is introduced into the portion of the A I chain that would be a protease-sensitive loop if the 
A l chain was a full length A chain. 

A ilirther embodiment of the present invention is a mutant protein comprising a modified Al 
chain of a toxic protein into which an insert has hem introduced, wherein, the modified A 1 
chain comprises a wild-type Al chain containing a mutation in one cysteine; the insert is a 
polypeptide of varying amino acid sequence having a lengtli of at least 2 amino acid residues; 
and the insert is introduced into the portion of the Al cahin that would be a protease-sensitive 
loop if the Al chain was a full-length A chain, 

A further embodiment of the present invention is a method for identifying a ligand 
that bind to a specific target/receptor, comprising the steps of: exposing cells known to 
possess the target/receptor to members of a combinatorial protein library as disclosed herein; 

selecting members of the protein library which are observed to be toxic to the cells; 
evaluating the selected members of the protein library to detennine the sequence of the 
inserted region, whereby a peptide of the sequence of the inserted region is identified as a 
possible ligand for a target/receptor on the cell; and ftirther testing peptides of the sequence of 
the inserted region to contlnn that they are a ligand for the specific target/receptor. 
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A further embodiment of the present invention is a method for isolating a toxin specific for a 
known target/receptor comprising the steps of: exposing the target/receptor to a combinatorial 
protein library as disclosed herein; and isolating at least one protein species from the 
combinatoriai protein library captured by binding to the target/receptor. 

In another aspect of the present invention, the combinatorial protein Hbrajy can have the 
insert introduced at the N-terminus of the A chain sequence instead of in the protease 
sensitive loop region. 

iVnother embodiment of the present invention is a method for identifying a ligand that bind to 
a specific target/'receptor, comprising the steps of: exposing celis kjiown to possess the 
target/ l eceptor to iiienibei s of a combinatorial proleni Iibrar>' as disclosed herein; selecting 
members of the protein library which are observed to be toxic to the celisjevaiiiaiing the 
selected members of the protein library to determine the sequence of the inserted region, 
whert^y a peptide of the sequence of the inserted region is identified as a possible ligand for 
a target/receptor on the cell; and further testing peptides of the sequence of the inserted region 
to confirm that they are a ligand for the specific target/receptor. 

An other embodiment of the present invention is a method for isolating a toxin specific for a 
known target'receptor comprising the steps of: exposing the target'receptor to a combinatorial 
protein library as disclosed herein; and isolating at least one protein species from the 
combinatorial protein librarv' captured by binding to the target/receptor. 

Brief Desc ri ption of the Drawings 

Fig. i is a schematic diagram depictiiig iiie Al and A2 domains of wiid-iype SLT-1 . 
The A chain is composed of 293 amino acids. The cham is cleaved by furin to produce an Al 
catalytic fragment and an A2 C-temxinal tail non-covalently associated with the B peniamer. 
A protease-sensitive loop (stripped area) is defined by the only two cysteine residues in the A 
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chain (Cys 242 and 26i). Tyr??. Glul 67, Argl7C) and Trp203 represent residues important 
for the catalytic activity of the Al domain (arrows). 

Fig. 2A is a schematic representaCion of the SLT~1 A chain (1-293) with the breast 
cancer-associated MUCi epitope PDTRPAP (control sequence recognized by the mAb One 
M27) inserted between residues 245 and 246 and a 6-Histidine tag followed by a protease 
cleavage site (such as a fUrin or thrombin cleavage site) at its N-terminus. 

Fig. 2 B is a depiction of a SLT»i A cbain-tripeptide library construction where the 
three positions of the MUCi epitope recognized by the mAb One M27 are randomized (XXX 
region). The tripeptide library' was inserted m a naturaliy occurring loop region of the A 
chmi created by the presence of a disulfide bridge between Cys 242 and Cys 261 . 

Fig. 2 C is a depiction of the SLT-1 A chain tripeptide library construction of Figure 2 
B, wherein cysteine 261 has been mutated io an alanine. Note that in this case (as in many of 
the figures below) the disulfide bridge present in Fig. 2B can not be formed due to the 
absence of one of the cysteines. 

Figure 2D is a depiction of the SLT-l A chain tripeptide Kbrar>' construction of Figure 
2B, wherein cysteine 242 has been mutated to an alanine. 

Figure 2E is a depiction of the SLT-1 A chain tripeptide library constniction of Figure 
2B, wherein cysteine 242 and cysteine 261 have both been mutated to alanine residues. 

Figure 2F is a depiction of the SLT-l A chain tripeptide libntry construction of Figure 
20, wiierein the C242 residue has been faiotiiiylated. 

Figure 2G is a schematic representation of the SLT-1 Al chain (1-251 ) with the breast 
cancer-associated MUCI epitope PDTRPAP (control sequence recognized by the mAb One 
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M27) inserted between residues 245 and 246 and a Histidine tag followed by a protease 
cleavage site (such as a furin or thrombin cleavage site) at its N-terminus. 

Fig. 2 H is a depiction of our SLT-1 Al chain-tripeptide library construction where 
the three key positions of the MUC l epitope recognized by the mAb One M27 were 
randomized (XXX region). 

Fig. 2 I is a depiction of the SLT-1 Al chain tripeptide library construction of Figure 2 
H, wherein cysteine 242 has been mutated to an alanine. 

Figure 2 J is a depiction of the Si..T-t Al chain tripeptide library constnicdon of 
Figure 2H, wherein cysteine 242 has been biotinylated. 

Fig. 3 shows a representative ELISA data set from screening 96 distinct single A 
chain variants from our SLT-I A chain-tripeptide library with mAb One M27. Toxin variant # 
41 (Tables 1 and 2) gave a strong ELISA signal and had the expected epitope. 

Fig. 4A shows a schematic diagram of a 7-amino acid random segment inserted 
between residues 245 and 246 of the A chain. 

Fig. 4B shows a schematic diagram of a 7-ajijino acid random segment inserted 
between residues 245 and 246 of the A chain, wherein the cysteine at position 261 been 
mutated to an alanine. 

Fig. 4C shows a schematic diagraiti of a 7-amino acid random segment inserted 
between residues 245 and 246 of the A chain, wherein the cj-'Steine at position 242 has been 
mutated to an alanine. 
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Fig. 4D shows a schematic diagmn of a 7-arniiio acid random segnient inserted 
between residues 245 and 246 of the A chain, wherein the cysteine at positions 242 and 261 
have been mutated to alanines. 

Fig. 4E shows a schematic diagram of a T-amino acid random segment inserted 
between residues 245 and 246 of the A chain, wherein the cysteine at position 261 has been 
mutated to an alanine and the cysteine at position 242 has been biotinylated. 

Fig. 4F shows a schematic diagram of a 7-amino acid random segment inserted 
between residues 245 and 246 of the Al chain. 

Fig. 4G shows a schen^adc diagram of a 7-amino acid random segment inserted 
between residues 245 and 246 of the Al chain, wherein the cysteine residue at position 242 
has been biotinylated. 

Fig, 4H shows a schematic diagram of a 7-amino acid random segment inserted 
between residues 245 and 246 of the Al chain, wherein the cysteine residue at posi tion 242 
has been mutated to an anahne. 

Fig. 5 shows the results of tests on seven toxin variants that were identified as 
repeatafale killers of the human melanoma cell tine 518A2, The abscissa represents the log 
concentration of toxin used to treat the cells and the ordinate depicts the obser\'ed percentage 

of ceUs thai are viable after 48 hours. The closed triangles depicts the effect of the wild type 
toxin on 518A2 cells while the two most eftlcacious A chain vaiiants were termed SAM#3 
(open squares) and SAM#5 (X symbols). 

Fig. 6 shows the amino acid sequence corresponding to the wild-type SLI-1 protein of 
Figure 1 . Amino acid sequences are shown from N-terminus to C-terminus. 
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Figure 7A shows the amino acid sequence corresponding to the SLT-A chain of 
Figure 2A, less the N-temiinus tags (the His-tag and the protease cleavage site). 

Figure 7B shows the amino acid sequence corresponding to the SLT-A chain of 
Figure 2 A, with an example of one possible N-tenminus tag, comprising a 8-His-tag and a 
protease cleavage site. 

Figure 7C shows the amino acid sequence corresponding to the SLT-A chain- 

tn'peptide ]ibrar>' construction of Figure 2B, less the N-tenninus lags (the His-iag and the 
protease cleavage site). 

Figure 7D shows the amino acid sequence corresponding to the SLT-A chain- 
tripeptide library construction of Figure 2B, with an cxanipie of one possible N-tenninus tag, 
comprising a 8-His-tag and a protease cleavage site. 

Figure 8 A shows the amino acid sequence corresponding to the SLT-1 A chain 
Iripeptide library of Figure 2C and 2F, less the N-teiminus tags (the His-tag and the protease 
cleavage site). 

Figure SB shows the amino acid sequence coiresponding to ihe SLT-1 A chain 
tripeptide library of Figure 2C and 2F, with an example of one possible N-terminus tag, 
comprising a 8-His-tag and a protease cleavage site. 

Figure 8C shows the amitio acid sequence corresponding to the SLT-1 A chain 
tripeptide library of Figure 2D, less tiie N-tenninus tags (the ilis-tag and tlie protease 
cleavage site). 

Figure 8D shows the amino acid sequence corresponding to the SLT-1 A chain 
tripeptide library of Figure 2D, with an example of one possible N-terminus tag, comprising a 
8-His-tag aad a protease cleavage site. 

_ 1} _ 
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Figure 9A shows the amino acid sequence corresponding to the SLT-l A chain 
tripeptide hbrary of Figure 2E, less the N-tenuinus tags (the His-tag and the protease cleavage 
site). 

Figure 9B shows the amino acid sequence corresponding to the SLT- i A chain 
tripeptide library of Figure 2E, with an example of one possible N-terminus tag, comprising a 
8-His-tag and a protease cleavage site. 

Figure 9C shows the amino acid sequence coi'respondtng to tlie SLT-i A3 chain of 
Figure 2G, iess the N-terminus tags (the His-tag and the protease cleavage site). 

Figure 9D shows the amino acid sequence corresponding to the SLT-l Al chain of 
Figure 2G, with an example of one possible N-terminus tag, comprising a 8-His-tag and a 
protease cleavage site. 

Figure i OA shows the amino acid sequence corresponding to the SLT-l Al chain of 
figure 2H and 2 J, less the N-temiinus tags (the His-tag and the protease cleavage site). 

Figure lOB shows the amino acid sequence corresponding to the SLT-l Al chain of 
figure 2H and 2J , with art example of one possible N-terminus tag, comprising a 8-His-tag 
and a protease cleavage site, 

Figure I OC shows the amino acsd sequence corresponding to the SL1~-I Al chain of 
figure 21, less the N-terminus tags (the His-tag and the protease cleavage site). 

Figure 1 OD shows the amino acid sequence corresponding to the SLT-l Al chain of 
figure 21, with an example of one possible N-terminus tag, comprising a 8-His-tag and a 
protease cleavage site. 



wo 2»07/033497 



PCT/CA2006/(»1581 



Figvire 1 lA shows tlie amino acid sequence coiresponding to the SLT-i A chain of 
Figure 4A, less the N-temjinus tags (the His-tag and the protease cleavage site). 

Figure 1 IB shows the amino acid sequence corresponding to the SLT-1 A chain of 
Figure 4A, with an example of one possible N-tenninus tag, comprising a 8-His~tag and a 
protease cleavage site. 

Figure 11 C shows the smim acid sequence corresponding to the SLT-1 A chain of 
Figure 4B and 4E, less the N-terminus tags (the His-tag and the protease cleavage site). 

Figure 1 ID shows the amino acid sequence corresponding to the SLT-1 A chain of 
Figure 4B and 4E, with an example of one possible N-terminus tag, comprising a 8 -His-tag 
and a protease cleavage site. 

Figure 12A shows the amino acid setjuence corresponding to the SLT-1 A chain of 
Figure 4C, less the N-terminus tags (the His-tag and the protease cleavage site). 

Figure i2B shows the amino acid sequence corresponding to the SLT-1 A chain of 
Figure 4C, with an example of one possible N-terminus tag, comprising a 8-His-tag and a 
protease cleavage site. 

Figure \2C shows the amino acid sequence corresponding to the SLT-1 A cliain of 
Figure 4D, less the N-terminus tags (the His-tag and the protease cleavage site). 

Figure 1 2D shows the amino acid sequence corresponding to the SLT~ I A chain of 
Figure 4D, with an example of one possible N-tenninus tag, comprising a 8-His-tag and a 
protease cleavage site. 

Figure 13A shows the amino acid sequence corresponding to the SLT-1 Al chain of 
Figure 4F and 4G, less the N-terminus tags (the His-tag and the protease cleavage she). 
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Figure 13B shows the amino acid sequence corresponding to the SLT-1 Al chain of 
Figure 4F and 4G, with an example of one possi ble N-temiinus tag, comprising a 8-His-t ag 
and a protease cleavage site. 

Figure 13C shows the amino acid sequence corresponding to the S.LT-1 Al chain of 
Figure 4H, less the N-temiinus tags (the His-tag and the protease cleavage site). 

Figure 1 3D shows the amino acid sequence corresponding to the SLT-1 Al chain of 

.Figure 4.H, with an example of one possible N-lenninus tag, comprising a S-His-tag and a 
protease cleavage site. 

Figure 14A shows the amino acid sequence corresponding to the SLT-1 A region, 
with cysteine 261 mutated to an alanine. 

Figure I4B shows the amino acid sequence corresponding to the SLT-1 A region, 
with cysteine 261 mutated to an alanine, and with an example of one possible N-terminus tag, 
comprising an 8-HIS-tag and a protease cleavage site. 

Figure i4C shows the amino acid sequence corresponding to the SLT-1 A region, 
with cysteine 242 mutated to an alanine. 

Figure I4D shows the amino acid sequence corresponding to the SLT-1 A region, 
with cysteine 242 mutated to an aiatiine, and with an example of one possible N-tenntnus tag, 
comprising an 8-HIS-tag and a protease cleavage site. 

Figure ISA shows the amino acid sequence coixespondmg to the SLT-1 A region, 
witl^ cysteines 242 and 261 mutated to alanines. 
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Figure i5B shows the amino acid sequence corresponding to the SLT-i A region, 
with cysteines 242 and 261 mutated to alanines, and with an ex^iple of one possible N- 
termimis tag, comprising aii 8-HlS-tag and a protease cleavage site. 

Figure 15C shows the amino acid sequence corresponding to the SLT-1 Al region. 

Figure 15D shows the amino acid sequence con-esponding to the SLT-1 Al region 
with an example of one possible N-temiinus tag, comprising an 8-HIS-tag and a protease 
cleavage site, 

Figure 16A shows the animo acid sequence coiresponding to the SLT-1 Al region, 
with cysteine 242 mutated to an alanine. 

Figure 16B shows the amino acid sequence corresponding to the SLT-1 Al region 
with cysteine 242 mutated io an alanine, and with an example of one possible N-terminus tag, 
comprising an 8-HIS-tag and a protease cleavage site. 

For figures 6-16 that include the N-terminus tag, the tag can be varied as needed; for 
example, the tag KGMRSHHHHHHHHRV ARAS can be used instead of 
KGMRSHHHHHHHHIEGRAS. 
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Detailed Description of the Invention 

The present invention relates to a combinatorial protein library comprising a plurality 
of protein species, each protein species comprising an A or Al chain ofa heteromeric toxic 
protein into which an insert has been introduced. The insert is a pol>peptide of varying 
amino acid sequence having a length of 2 or more amino acid residues, for example &om 3 to 
200 amino acid residues; and is introduced into the protease-sensitive loop of the A chain 
sequence, or the equivalent remaining area in the Al chain sequence. The library provides a 
collection of protein species that can be screened for individual proteins that are toxic against 
specific cell types, such as specific ciuicer cell iypts. individual protein species thus selected 
are sui tably used in the treatment of the cancer. 

As used in the specification and claims of ihis application, the term "combinatorial 
librarv'" refers to a mixture of species each of which has a common portion und a variable 
portion. In the case of a "combinatorial protein library" each of the species is a protein or 
peptide, and the common portions and the variable portions are each amino acid sequences. 
In the case of a combinatorial expression library, the species are microorganisms, cxpr«5ssiou 
vectors or pol>'nucieotides vi^hich, when expressed, produce proteins or peptides having 
common portions and variable portions. In this case, the common portions and the variable 
portions are each nucleotide sequences. Since the purpose of the combinatorial library is to 
provide muUiple variants for screening purposes, the combinatorial library preferably 
contains at least 100 distinct species of protein or expression unit, more preferably at least 
1000 distinct species. 

As used in the specification aud claims of tiiis application, the tenn "heteronieric toxic 
protein" refers to the class of protein toxins with the common organization theme of being 
heteromeric in nature with two or more polypeptide domains or subunits responsible for 
distinct functions (Merritt, E,A., and Hoi, W.G J. (1995) Curr. Opin. Struct. Biol. 5:165 1). in 
such proteins, the two or more subunits or domains could be referred to as A and B, and the 
toxins as ABx toxins where x represents the number of identical or homologous B subunits in 
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the toxin. This family of franiework-related toxins includes examples such as Shiga and 
Shiga-like toxins, the E. coli heat-labile enterotoxins, cholera toxin, diphtheria toxin, 
pertussis toxin, Pseudomonas aemgiiiosa exotoxin A as well as plant toxins such as ricin and 
abrin. Based on their ability to block protein synthesis, proteins such as Shiga and Shiga-like 
toxins as well as ricin, abrin, gelonin, crotin, pokeweed antiviral protein, saporin, momordia, 
modeccin, sarcin, diphtheria toxin and exotoxin A have been referred to as ribosorae- 
snactivating proteins (RIP). In these naturally-occurring heteromeric toxic proteins, the A 
chain is the toxic portion, while the B chains form a binding moiety which binds to a receptor 
on a cell susceptible to the toxin, thereby delivermg the A chain to the cell. In many cases, 
the A chain has been further characterized as having two domains, an Al domain and an A2 
domain. T>pically, the Al domain includes the cjlotoxic elements of the A sitbunit and a 
portion of the protease-sensitive loop portion, but often does not include the B chain binding 
portion. 

One specific example of the A chain of a heteromeric toxic protein is the A chain of 
SLT-1 which has the sequence given is Seq. ID No, 1 . The A chain of SLT-1 comprises of 
293 amino acids with the enzymatic (toxic) domain spanning residues 1 to 239. A protease 
sensitive loop encompassing residues 242 to 261 is normally exposed, and is a sititable site 
for inserting a peptide sequence. The A chain of SLT- 1 is characterized as having an Al 
domain, spanning residues 1 to 251, and including the entire enzjTOatic (toxic) domain, and 
an A2 domain spanning residues 252-261 . 

SLT-1 is a t>pe II ribosome inactivating protein produced by pathogenic strain of 

Escherichia coH (0{57:H7) (24). SLT-i is an AB5 complex of about 70 kD (O'Brien, A. D., 
and Holmes, R. K. (1987) Shiga and Shiga-like toxins, Microbiol Rev 51, 206-220.). The 
single 32 kD catalv-tic A subunit is non-covak'ntly associated wjth a pentamer of five identical 
7.7 kD B subunits. The B subunit peniaraer recognizes the glycoiipid globotriaosylceramide 
(also known as CD77 or Gb3) on the surface of target cells (Lingwood, C. A. (1993) 
Verotoxins and their glycoiipid receptors. Adv Lipid Res 25, 1 89-21 1 ; Jaccwicz, et al. (1986) 
Patliogenesis of shigella diarrhea. XI. Isolation of a shigella toxin-binding glycoiipid from 
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rabbit jejunum and HeLa cells and its identification as globotriaosylcerainide. .1 Exp Med 
163, ] 391-1404). A protease-sensitive !oop located between Cys242 and 261 at the C 
terminus of the A chain is cleaved by furin during cellular routing (Fig. 1). The A chain 
remains associated with its B subunit pentamer due to an intrachain disulfide bond between 
Cys242 and Cys261 as it travels to the ER lumen (Sandvig, et ai, (1989) Endocytosis from 
coated pits of Shiga toxin: a glycoHpid-binding protein from Shigella dysenteriae 1. J Cell 
Biol 108, 1.331-1343;32. Garred, et a) , (1995) Role of processing and intracellular transport 
for optimal toxicity of Shiga toxin and toxin mutants, Exp Cell Res 218, 39-49,). The 
disulfide bond is finally reduced in the ER lumen and tlie Al chain (first 251 aa) is released 
and subsequently retrotranslocaicd to the cyiosol where it inactivates ribosomes (O'Brien, et 
al. (1992) Shiga toxin: biochemistry, genetics, mode of action, and role in pathogenesis. Curr 
Top Microbiol hnmanol 180, 65-94.). More specsficaHy, She A chain of SLT-1 is a 
N-giycosidiJse that catalytically cleaves a specific adenine nucleotide (4324) from the 28 S 
rRKA (Brigotti, et al. (1997) The R,NA-N-glycosidase activity of Shiga-Uke toxin I: kinetic 
parameters of the native and activated toxin. Toxicon 35, 1431-1437.). This event leads to 
the inhibition of protein synthesis by preventing the binding of aminoacyi tRNAs to the 
ribosome and halting protein elongation. Mutagenesis studies as well as structural analysis 
performed on the A chains of ST and rtcin have delineated key conserved residues involved 
in catalytic activity (Deresiewicz, et al.(1992) Mutations affecting the activity of the 
Shiga-like toxin I A-chain. Biochemistry 31, 3272-3280; Ready, et al. (1991) Site-directed 
mutagenesis of ricin A-chain and implications for the mechanism of action. Proteiiw 10, 
270-278). Residues important for catalytic activity of SLT-1 are tyrosine 77, glutamic acid 
167, arginine 170 and triptophan 203 (Hovde, et al. (1988) Endence that glutamic acid 167 
is an active-site residue of Shiga-like toxin I, Proc Natl .A.cad Sci U S A 85, 2568-2572; 
Yamasaki, et ai. (1991) importance of arglm'iie at position 170 of the A subunit of Vero toxin 
I produced by enterohemorrhagic Escherichia coli for toxin activity. Microb Pathog 11, 1 -9). 
hi addition, binding of the toxin to the cell surface is important to introduction into the cell 
and thus for toxic activity. Because of tills the A chain alone is not significantly toxic. 
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In addition to the A or Al chain of SLT-1, other toxins may also be used to form 
libraries and compositions in accordance with the invention, and maybe used in the metljods 
of the invention. Specifically, Shiga and other Shiga-like toxins, as well as ricin, abrin, 
gelonin, crotin, pokeweed antiviral protein, saporin, momordin, modeccin, sarcin, diphtheria 
toxin and exotoxin A, and otlier functionally-related ribosorae inactivating proteins (RIP) 
may be used. 

For purposes of making the combinatorial library of the invention, short amino acid 
sequences of at least 2 amino acids, for example of 3 to 200 amino acids residues in length, 
are inserted into this protease sensitive loop- The number of amino acids in the insert defines 
tile mtniber of possible random variants that can be in the library. For example, when the 
number of amino acids in the insert is 3, the maximum number of variants is 20'' or SOOO 
variants. Larger inserts provide cotresponding Ua-ger numbers of possible variants. It has 
also been found that certain other regions of the A or Al chain may be used to insert the short 
amino acid sequences. The present inventors have shown, for example, that the N terminus 
region can support insertions of long amino acid sequences without significant loss of 
activity. 

As an alternative to using inserts with purely random sequences, inserts can be 
designed based on a known template. For example, as described below in the context of 
Muc-1 , variations in a sequence known to provide receptor binding properties for a piirticuiar 
cell type can be used to identi fy an insert that provides for optimization of the toxic properties 
of the protein construct. This same optimization may be performed on an individual 
sequence isolated by screening of a larger conibinatorial librao'- it will be appreciated, 
however, that the insert in the proof of principle tests of Example 1 use an insert which is the 
target/receptor, while in the actual case tJje insert would be based on tire sequence of a known 
ligand to be optimized for maximum effectiveness and specificity. 

This approach, in which a specific, known receptor type is targeted illustrates a further 
aspect of the invention, namely a method for identifying peptide ligands that bind to specific 
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targets/receptors, such as tumor markers known to exist on cancer cells. In this method, a 
combmatorial protein library in accordance with the inveivtion is screened against ceils known 
to possess the target/receptor. The toxin ser\'es as the reporter, such that proteins which are 
shown to be toxic to the cells are evaluated to detennine the sequence of tlie inserted region. 
Peptides of this sequence can then be used, in combination with a toxin or other molecules, to 
direct compounds to cells possessing the target/receptor. Further testing peptides of the 
sequence of the inserted region may be appropriate to confirm that they are a Ugand for the 
specific target/receptor, as opposed to some other receptor on the cell type. This can be done 
using binding assays with isolated receptor, where such are available. 

These inserts used in aUeniative to inserts wi*h purely randonr^ sequences can itse, as a 
known template, a smaO protein molecule with a known or desirable folded structure, for 
increasing sensitivity or specificity of binding. For example, an immunoglobulin like 
domain, such as a FC molecule sequence, a fragment of an FC molecule sequence, or a mimic 
of an FC molecule sequence, can be inserted. Libraries can therefore be created with a 
variable chain domain region of an FC molecule, or a random 7-mer (or otiier length random 
sequence) inserted in the variable region of an FC molecule, in tuni ijiserted in tlie protease- 
sensitive loop region of the A or Al molecule. 

The invention also provides a method for identifying toxic substances specific for a 
known cell marker, and particularly markers that are available in isolated form. In this 
embodiment of the invention, the toxin need not serve as reporter. Thus, cells having the 
marker, or an isolated target/receptor, where available, are exposed to the combinatorial 
protein library. In preferred embodiments, the cells or the isolated target receptor are 
im.mobili2ed on a solid support, such as in plastic wells. Captured proieins from the library 
are then rescreened against cells to confirm their tcxicity and specificity for ceils expressing 
the target/receptor, and their suitability for use as a therapeutic. This method can be used to 
identify toxins with bindi ng inserts specific for any tumor marker or ceil receptor, including 
without limitation tumor markers such as mucins such as MUC-1 and its giycoforms, Her-2, 
Her2-Neu, tyrosine kinase markers, EGFR, GD2, and OD3. 
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Thus, in accordance with this specific aspect of the invention, a method for isolating 
a toxin specific for a known target/receptor is provided that comprises the steps of: 

(a) exposing the target/receptor to a combinatorial protein library comprising a 
plurality of protein species, each protein species comprising an A or Al chain of a toxic 
protein into which an insert has been introduced, wherein, 

the insert is a polypeptide of varying amino acid sequence having a length of at 
least 2 amino acid residues; and 

the insert is introduced into the protease-sensitive loop of the A chain 
sequence or the equivalent portion of the Al chain sequence, md 

(b) isolating at least one protesn species from the combinatoria! protein library 
captured by binding to tlie target/receptor. As u,sed in tlie specification and claims hereof, 
the term "isolating" refers to any mechanism for obtaining a composition containing the 
protein separated from the milieu in which it is expressed in a form suitable for further 
analysis. This would include release from the target/receptor following capture (for example 
by exposure to a competitive binding agent) or isolation from a culture of a clone expressing 
the protein found to be captured. The method may further comprises the step of screening 
the isolated protein against cells expressing the target/'receptor, to confirm their toxicity for 
ceils expressing the target/receptor. Procedures suitable for this screening are described in 
Example 3. As noted above, in this method, the target/receptor may be a purified 
target/receptor and may be immobilized on a solid support. The target/receptor may also be 
on the surface of cells, which may be immobilized. Where the target ' receptor is on the 
surface of cells, the toxin can serve as a repoiter, and the death of the ceils i s indicative of 
receptor binding. 

Our experience with constructing Si.T-1 libraries has pointed out a number of 
practical issues that are appropriately considered in selecting a protein template for building 
combinatorial libraries. An important factor is to choose a single chain protein, preferably a 
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bacterial protein of less than 300 amino acids (if the libraries are to be expressed in 
prokaryotes). Use of a smaller toxin both increases its potartial to penetrate into solid tumors 
and reduces its immonogenicity. Secondly, the protein template should spontaneously fold in 
solution into its active form such that there is minimal need for host's chaperones. One 
should avoid for example scaffolds that contain multiple cysteine residues normally involved 
in disulfide bridges. In addition, a single chain protein as opposed to a multi-subunit complex 
may be more easily exported from bacteria. Thirdly, the protein template should possess an 
enzymatic activity, which can rapidly be measured to confirm the proper folding of peptide 
variants containing smgls and multiple site-directed mutations. Fourthly, a simple screening 
approach should be incoiporated inio tlie design of combinaloriai libraries. Such searches 
should be amenable To high-throughput screening methods. The catalytic A chain of SLT-1 
(residues ! to 293) and the Ai chain ofSLT-i Residues 1-25!) meet these criteria becaase 
they are a single chain that lack an}^ known receptor binding function, and that have well 
defined stnictures and catalytic sites. 

We have also found that cysteine 241 and 261 of the A chain of SLT-1 can be 
replaced by site specific mutagenesis (or otherwise modified) in order to remove the single 
disulfide bridge present between these two cysteines in the A chain. Such advantageous 
mutations remove the need to reduce the peptide chain during purification and provide a 
simpler strategy for purifying single chain toxins comprising the A I domain of A chain 
mutants. We have found that either cysteine 242 or cysteine 261 , or bolh, can be mutated in 
this manner. The cysteine(s) can be mutated to alanine, though other mutations may result in 
the same desired effect while providing a suitable A or Al chain. 

An other advantage of mutating the cysteines in this manner results from the mutation 
of either of C242 or C261 (preferably, C261 ), This results in a shiga-Hke protein A chain 
with only one cysteine. The thiol gi'oup associated with this single cysteine residue is 
therefore free and offers the opportunity to iniroduce spectroscopic (derivatives of fluorescent 
chromophores that react with the thiol moiety), radioactive (thiol-reactive heavy ion 
complexes or metal chelators), or biochemical (for exaniple, biotin) probe. These A chain 
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variants, labeled with reporter probes, can then be used as agents for both in vitro and in vivo 
applications. For example, an inactivated single chain A or Al variant, labeled with a 
reporter probe (and inactivated by mutating E167A and Rl 70A) can be used as a non-toxic 
prc*e, as described above, to determine which patient papulation may benefit from being 
treated with which of the corresponding catalytically active single chain variants. A library of 
inactivated single chain Al variants labeled in this manner can also be iBcd to determine 
which binding domains (from said library) are effective, to determine which binding domains 
can be used in corresponding catalytically active single chain variants to treat a patient. 
Alternatively, these non-toxic probes caji be used to deteiTnine which binding domains (from 
the randomly mutated library) can be used to targei the desired cell population and used in 
either the toxic subunit, or bound to a separase and known therapeutic agent, to target and 
treat that cell population. Alternatively, ones a specific desired library member has been 
identified, either through binding studies as described above or otherwise, tiiat one specific 
library' member (for example, an Al chain with a specific desirable 7nier sequence inserted 
after amino acid 245) can be biotinylaled for further purification or other studies. 

Alternatively, the entire library can be biotinylated, allowing the library to be assayed 
with a recq)tor based assay rather than a toxin based assay. 

We have also surprisingly found that the use of the Al domain of the A chain is 
sutficicnt for performing the assays described herein. Although the Al domain contains the 
entirety of the cytotoxic domain of the A subunit, it was not known until know what function 
was served by the A2 subunit in assa>'s of this kind, and whether the Al subunit could be 

succesfuily used on its omi in this type of assay. By removing the A2 portion of t-he A chain, 
a sniaUer, more easily maaipulated fragment can be used in the assays described herein. 

Combining the unexpected finding that cysteine mutations result in a functional A 
subunit useful in the assays described herein, with the finding that the Al subunit can be used 
alone yields the further advantage of combining these two findings. By mutating one of the 
cysteines (for example, C261), one can make a protein that can not form the protease- 
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sensitive A chain loop (since this proteasc-sesisitive A chain loop is a result of the cysteine- 
cysteine interactions between C242 and C261), The A subunit can therefore be cleaved more 
efficiraitly into Al aiid A2 subunits. Thus mutation of at least one of C242 and C261, 
combined with utilizing the Ai subunit alone, provides a simpler and more elegant strategy 
for purifying single chain toxins comprising the Al domain of A chain mutants. 
Manufacturing, and purifying, a protein or a protein library of use in the assays described 
Iierein thus becomes much more efficient by either utilizing the A! domain alone, at least one 
cysteine mutation, or both, 

A flirther aspect of the invention is a comhirsaioriai expression librai-y comprising a 
plurality of species of expression systems. Each species within the expression library' 
expresses a protein species comprising an A or A I chain of a heteromeric toxic protein into 
which an insert has been mtix»duced. Tlie insert is a polypeptide of varying amino acid 
sequence liaving a length of 2 or more amino acid residues, for example from 3 to 200 amino 
acid residues; and is introduced into the protease-sensitive loop of the A chain sequence. 
Suitable expression systems include plasmids and viral vectors. 

In a further aspect of the invention, the A or Al chain can be pegyiated or otherwise 
modified by a method known in the art in order to stabilize or modify the biological half-life 
of the protein. Reactive pegylation agents can be bound to a Ij^ine, to the N-terminus, or to 

cysteine 242 or 261. 

We have aiso found that adding a histidine tag (for example, a 6 or 8 liistidine tag) to 
the N-temiinits of the A or Al chain is advantageous as it allows for improved purification of 
the desired protein. Optionally, a prolease-sensitive region can be inserted between the 
histidine tag and the rest of the A or Al chain in order to facilitate cleavage of the histidine 
tag from (he A or Al chain once it is no longer desired. Examples of protease-sensitive 
regions that may be used include a thrombin sensitive region or a furin sensitive region. 
Other protease-sensitive sequences are aiso well known in the art. 
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The invention wiii now be further described with reference to the following, aon- 
liraitiiig examples. 

Example 1 

Proof-of-coacept: design and mining of a prototypic A chain-tripeptide librar>' 

We originally created a simple tripeptide library inserted in the C-terminal 
protease-sensitive loop of the SLT-1 A chain (Figs. 2A and B). This A chain loop region is 
naturally constrained due to the presence of a single disulfide bond bridging Cys242 to 
Cys261. The maximal diversity of this library can thus be calculated to be 20"^ or 8000 
permutations of a tripeptide secjuence. As a proof-of-coocept that A chain libraries can easily 
be screened for a new receptor-binding activity, we picked more than 3000 colonies ftom this 
A chain-tripeptide library and purified the mutant toxin produced by each clone. We noticed 
verj' early in this study that the level of expression of A chain mutant was dramatically 
increased when expressed in the presence of the wild-type SLT-1 B subunit. Thus the mutant 
forms of A chain were expressed and initially purified as AB5 toxin variants. Since all A 
subimits harbor a polyHis purification tag, it is relatively easy to remove the B subunit with 
denaturants (urea for example) while recovering the A chain on metal-affinity columns or 
beads. Western blots perfonned on randomly selected bacterial clones indicated that > 70% of 
these colonies produced significant amounts of these A chain mutants. 

These toxin variants were then coated io individual wells of 96-well plates and 
screened by ELISA for tiieir ability to bind the monoclonal antibody One M27 (Linsley, et 
al.(1988) Monoclonal antibodies reactive with nmcin glycoproteins found in sera from breast 
cancer patients. Cancer Res 48, 21 38-2148.), directed at tlie weli-characterized breast cancer 
tripeptide epitope Thr-Arg-Pro of the human MUC3 tandem repeat (Gendler, et al, (1988) A 
highly immunogenic region of a human polymorphic epi theiial mucin expressed by 
carcinomas is made up of tandem repeats. J Biol Chem 263, 12820-12823; Girling, et al. 
(1989) A core protein epitope of thepolymorijhic epithelial mucin detected by the 
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monociotial antibody SM-3 is selectively exposed in a range of primarj^ carcinoraas. M J 
Cancer 43, 1072-1076). As shown in Tables 1 and 2, most A chain mutants neither coded for 
the tripeptide insert that matched the targeted epitope of One M27 nor were recognized by the 
antibody. However on two occasions, a toxin variant harboring the exact epitope (Table 1, 
Fig,3) gave a strong ELISA signal comparable to the one observed with our control A chain 
harboring the MUCl tripeptide epitope and had the expected epitope sequence in its 
randonrized tripeptide region. A typical ELISA data set for 96 A chain mutants is presented in 
Fig. 3, highlighting the fact that {he majority of A chain mutants did not recognized the mAb 
One M27 except for an A chain variant (mutant #41 in Tables 1 and 2). Tliese results clearly 
established that A chain libraries can easily be coiistnscted and screened to fnid toxin variants 
able to specillcaliy target a given receptor, in this case an antigen-combining site. 



Proof»of-concept: design and mining of a prototypic Al chain-tripepttde library 

A tripeptide library inserted in the C-terrainal SLT-1 A I domain is prepared. A 
simple tripeptide Ubiary is inserted in the C-terminal protease-sensitive loop of the SLT-1 A 
chain (Figs, 2 A and B). This A chain loop region is naturaiiy constrained due to the presence 
of a single disulfide bond bridging Cys242 to C\^261 , The maximal diversity of this hbrary 
can thus be calculated to be 20" or 8000 permutations of a tripeptide sequence. A chain 
libraries can easily be screened for a new receptor-binding activity. Over 3000 colonies from 
this A chain tripeptide librar>' are picked and the n -utaiit toxin produced by each clone is 
purified. The level of expression of A chain mutant is dramatically increased when expressed 
in the presence of tlie wtid-tvpe SLT'-l B subumv. Thus the mutant fonns of A chain are 
expressed and initially purified as AB5 toxin vai-iauts. Since all A subunits harbor a polyHis 
purification tag, it is relatively easy to remove die S subuntt svith denaturants (urea for 
exarapie) while recovering ihe A chain on metal-affinity columns or beads. Western blots 
performed on randomly selected bacterial clones indicate that > 70% of these colonies 
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produce significant amounts of these A chain mutants. The Al subunit is then excised from 
tlie A cliain mutants, using a protease cleavage using furin (Fig. 2G and H). 

These toxin variants are then coated in indi vidual wells of 96-well plates and screened 
by ELIS A for their ability to bind the monoclonal antibody One M27 (Linsley, et al.(l 988) 
monoclonal antibodies reactive with mucin glj'coproteins found in sera from breast cancer 
patients. Cancer Res 48, 2138-2148.), directed at the well-characterized breast cancer 
Iripeptide epitope Tlir-Arg-Pro of the human MUCl tandem repeat (Gendler, et al. (1 988) A 
high!y immunogenic region of a human pol>'moiphic epithelial mucin expressed by 
carcinomas is made up of tatidcm repeats. J Biol Chem 263, 12820-12823; Girling, et al. 
(t9S9) A core protein epitope of the pol>'niorphic epitheiia? mucin detected by the 
monocioaai antibody SM-3 is selectively exposed in a range of primary carcinomas. Int 3 
Cancer 43, 1072-1076). MostAl chain mutants neither code for tlie tripeptide insert that 
match the targeted epitope of One M27 nor are recognized by the antibody. However on a 
few occasions, a toxin variant harboring the exact epitope gives a strong ELISA signal 
comparable to the one observed with our control A chain harboring the MUCl tripepUde 
epitope and has the expected epitope sequence in its randomized tripeptide region. These 
results clearly establisli tliat Al chain libraries can easily be constructed and screened to find 
toxin variants able to specifically target a given receptor, in this case an antigen-combining 
site. 

Example 3 

Proof-of-concept; Alternative method for design and mining of a prototypic Al 
chain-tripeptide library 

A tripeptide library inserted in the C-terminal SLT-1 Al domain is prepared. A 
simple tripeptide library is inserted in the C-temiinal protease-sensitive loop of the SLT-1 A I 
chain (Fig. 2 G and H), Tliis Al domain codes for A region amino acids 1-251. The 
maximal diversity of this library can thus be calculated to be 20^ or 8000 permutations of a 
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tripeptide sequence, A chain libraries can easily be screened for a aew receptor-binding 
activity. Over 3000 colonies from this A I chain tripeptide library are picked and tfie mutant 
toxin produced by each clone is purified. Since all Al subunits lack a B subunit binding area, 
it is relatively easy to exise any B subunits. Optionally, the polyHIS tags can be used to 
further purify the Al subunits, as described in previous examples. 

These toxin variants are tlien coated in individual wells of 96-we!l plates and screened 
by ELISA for their ability to bind the monoclonal antibody One M27 (Linsley, et al.(1988) 
monoclonal antibodies reactive with mucin glycoproteins found in sera from breast cancer 
patients. Cancer Res 48, 2 1 38-2148.), directed at the well-characterized breast cancer 
tripeptide epitope Thr-Arg-Pro of the human MUCl tandem repeat (Gendler, et al. (1988) A 
highly iramunogemc region of a human poivrnoqihic epithehal mucin expressed by 
carcinomas is made up of tandem repeats, J Biol Chem 263, 12820-12823; Girling, et ai. 
(1989) A core protein epitope of the polymorphic epithelial mucin detected by the 
monoclonal antibody SM-3 is selectively exposed in a range of primary carcinomas. Int J 
Cancer 43, 1072-1076). Most Al chain mutants neitlier code for the tripeptide insert that 
match the targeted epitope of One M27 nor are recognized by tlie antibody. However on a 
few occasions, a toxin variant harboring the exact epitope gives a strong EUSA signal 
comparable to the one obsm-'ed with our control A chain harboring the MUCl tripeptide 
epitope and has the expected epitope sequence in its randomized tripeptide region. These 
results clearly establish that Al chain libraries can easily be constnicted and screened to find 
toxin variants able to specifically target a given receptor, in this case an antigen-combining 
site. 

The Al tripeptide librar\' can also be bioiinyiated at the C242 position (Fig\ire 2j). A 
standard biotio assay can then be used to detennine whether an A I tripeptide iibnay member 
is present, or bound to a sub.strate or given receptor. 
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Proof-of-concept: Method for design and mtniag of a prototypic A cfaain-tripeptide 
library with a mutation in C242 and/or C 261 

A tripeptide library inserted in the C-terminal SLT-1 A domain is prepared, using ihe 
method of Example 1 with the modification that encoding for the cysteine at poisition 242 
and/'or position 261 has been mutated to an alanine using site directed mutagenesis (T?igures 
2C, 2D, or 2E). For libraries containing one cysteine in the A chain, the tripeptide library is 
optionally biotinylated (Fig. 2F), The A chain libraries can easily be screened for a new 
receptor-binding activity. Results indicate that these cys to aia modified A chain libraries are 
at least as effective as the wild-type libraries of Exfunple I in ease of construction and 
screening to find toxin viuriants able lo specifically larget a given receptor. the case of the 
biotinylated hbraries, the biotin labels are easily used, utilizing known methods, to identify 
whether a member of the A chain library is present or bound to a sample. 

Example 5 

Proof-of-Goncept: Method for design and mining of a prototypic Al cliain-tripeptide 
librarv' with a mntation in C242 and/or C 261 

A tripeptide library inserted in the C-terminal SLT-1 Al domain is prepared, using the 
method of Example 2 with the modification that encoding for the cysteine at position 242 

and'' or position 261 has been mutated to an alanine using site directed nnulagenesis (Fig. 21). 
The A I chain libraries can easily be screeded for a nesv receptor-binding activity. Results 
indicate that these cys to ata modified A I chain libraries ate as effecti ve as the wild-type 
libraries of Example 2 in ease of conslructioii and screening to find toxin variants able to 
specifically tsyget a given receptor. Results indicate that diese cys to ala modified Al chain 
libraries are significantly easier to make and purify, due to the lack of the disulfide bridge 
between Cys 242 and Cys 261, resulting in a much easier cleavage of A2 from A 1 in the 
protease cleavage step. Jn the case of an Al tripeptide h'brary having a cysteine at position 
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C242, the cysteine is optionally tagged with biotin (Fig. 2J). This allows for determination of 
whether the library member is present, or bound to a substrate or receptor, using known 
methods. 

Example 6 

Proof-of-concept: Method for design and mining of a prototypic Al chain-tripeptide 
library with a mutation in C242 and/or C 26.1 

A tripeptide library inserted in the C-temiiiiai SLT-1 A I domam is prepared, using the 
method of Example 3 with the moditicaiion that encoding for the cysteine at position 242 has 
been mutated to an alanine using site directed mutagenesis (Fig. 2T). The Al chain libraiies 
can easily be screened for a new receptor-binding activity. Results indicate that these cys to 
ala modified A I chain libraries axe as effective as the wild-type libraries of Example 4 in ease 
of construction and screening to find variants able to specificany target a given receptor. 

Making of Combinatorial SLT-1 A-heptapeptide or Al-heptapeptide Library 

Library diversity represents a crucial parameter in screening combinatorial libraries 
for Hgands able to bind specifically and with high affinity to a particular target. The SLT- 1 A 
chain-tripeptide Hbrar>' described in Examples I or 4, and the SLT-1 Al chain-tripeptide 

libraries described in Examples 2, 3, 5 and 6 have a iiiaxinial diversity of 20" or 8000 
possible mutated A chains. This sampling repertoire is small but was useful in mapping a 
tripeptide epitope to establish our proof-of-concept. A seven-residue library (20^ or 1.3 x 10*^ 
possible mutants) represents a more typical minimal diversity level commonly used in 
designing phage display as vvell as synthetic peptide libraries. Thus, as a starting point, we 
build SLT~i A and Al chain libraries (as described in Examples i-6) with a 7-amino acid 
long random sequence inserted in its C-terminus (Fig. 4A-H). Each of these libraries provide 
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sufficient diversity to insure that A chain or Al chain toxin variants can be identified that 
target new or known internalized receptors on cancer cells. All elements of each of these 
libraries (as well as all other libraries proposed) contain a N-temiinal His tag to quickly purify 
A chain mutants. The libraries (Fig. 4A-4H) are generated using a megaprimer PGR strategy 
(Sarkar G, and Sommers S, (1990). The 'megaprimer' method of site-directed mutagenesis. 
Biotechniques 8, 404-407). The megaprimer strategy is widely used to introduce mutations 
into target DNA sequence, by way of two rounds of PGR that use two flanking primers and an 
internal mutagenic primer. 

All A chain-heptapeptide iibraries and A] chaiii-heptapeptide iibraries are made, as 
foiiows. T he SLT-i A chain-heptapeptide libraries (Fig. 4A-4H) harbor a 7-amino acid 
random insertion between amino acid 245 and 246 of the A or Al chain, tlie identical site 
used to construct our SLT-i A chain-tripeptide library,' (Tables 1, 2, Fig, 3). Briefly, 
consti-uction of the A chain heptapeptide libraries is described as follows using as the 
example an A chain containing 2 cysteines. Construction of the A 1 chain heptapeptide 
libraries is undertaken using similar methods, with modifications as necessary to form an Al 
chain (and as described in parallel Examples 2, 3, 5 and 6, above). 

Two flanking primers A (GTT ACT GTG ACA GCT GAA GOT 'ITA CGT TTT CG 
(Seq. ID No. 2) and B (GAG A.'^G AAG AGA CTG CAG ATT CCA TCT GTT G (Seq. ID 
No, 3)) carrying Hindi 1 1 and Pstl restriction sites respectively were annealed witliin the 5' 
and 3' ends of the SLT-l operon. A library oligonucleotide F containing all seven random 
amino acid (NNS) as well as a long matching sequence to anneal to the template were 
s>'nthesized. In the synthesis of the random oligonucleotide, the relative representation of 
each amino acid was improved by restrictitig the tliird position of each codon to G or T 
(Noreri, K. A., and Noren, C. J. (2001) Constmction of high-complexity combinatorial phage 
display peptide libraries. Methods 23, 169- 1 78.)- This type of restriction reduces the overall 
DNA sequence complexity as well as coding discrepancy between residues (Reidhaar-Olson, 
et al, (1991) Random mulagenesi,s of protein sequences using oligonucleotide cassettes. 
Methods Enzyraol 208, 564-586. ). This strategy also minimizes the occurrence of stop 
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codons (TAA ssid TGA) while the stop codoa (TAG) is suppressed by using a supE bacterial 
strain, which specifies for the insertion of a Gin residue when TAG codons are translated. 
The first PCR reaction was performed using priniers A and F and the resulting product 
purified by denaturing polyacrylamide gel electrophoresis. This product then ser\'ed as the 
megaprimer with primer B for a second PCR reaction to amplify the random DNA. The final 
library DNA (PCR product) will then be digested with Hindi 1 1 and Pstl and cloned into the 
backbone of a pECHE9a expression vector (MTT, Toronto). The E . coli strain JMiOl was 
subsequently transformed with the resulting pECHE vector and single bacterial colonies 
picked, lysed and their supeniatants analyzed for the expression of single A chain toxins or 
layered on cancer cells and screened using the SRB for cell cytotoxicitv' assay. 



Example 8 

Mining the combinatorial SLT~1 A-heptapeptide library against cancer cell lines using a 
cytotoxicity assay 

We screen oui" SLT-1 A-heptapeptide libraries and our SLT- 1 Al-heptapeptide 
libraries using the cytotoxic function of the A or AI chain as a reporter signal. Cytotoxicity is 
a more informative property to measure than binding to a receptor since it implies that the 
toxin is internalized, processed and delivered near ribosomes, clearly a multi-step event. The 
cytotoxicity assay is essentially performed ss previously described (Bray, et al. (2001 ) 
Probing the surface of cukaryotic cells using combinatorial toxin libraries. Ciarent Biology 
1 1 , 697-701 ). Briefly, the strategy to screen all our A chain libraries is based on the 
following principles. Established cancer cell lines such as SK-BR-3 (human breast), 
CAMA- 1 (human breast), 5 1 8 A2 (human raelanoraa), PC3 (human prostate) and B 1 6 
(murine melanoma) are gi-own in 96-well plates and used as targets in the primary screen 
stages. These cell lines are initially selected for our holotoxm library searches (Bray, .supra) 
based on their adherence (plastic), their cell viability stdning properties (SRB) in a 
high-tliroughput screening setting as well as their lack of receptor and sensitivity to native 
SLT-1 (to insure a reduced level of false positives). Single bacterial colonies from each 
library' are picked and grown in 96 deep well plates. The cells are harvested, lysed, and their 
lysates clarified. Since all expressed SLT~1 A chain variants or A! chain variants have a 
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histiduie tag at their N-terminus, each of them is purified from their lysate using 
nickel-affinity beads (96- well fonnat) and layered on target celts. The plates containing the 
target ceils treated with A chain variants or Al chain variants are then incubated at 3TC for 
48 hours, followed by fixation and staining with SiUforhodamine B (SRB). The SRB assay is 
a colorimetric end-point assay, which quantifies viable cells by staiiung their cellular protein 
content (Skehan, et al, (1990) New colorimetric cytotoxicity assay for anticancer-drug 
screening. J Natl Cancer Inst 82, 1 1 07-1 1 12.). The SRB assay has been adopted by NCL'Nffl 
for their high-throughput screening of drug candidates on cancer cell lines. Viability assays 
are repeated for any bacterial extracts leading to cell death. A first round of screening is 
performed on more than 5000 bacterial clones (equivalent to 5000 distinct A chain toxins) for 
each of the A chain and A! chain pentapeptide libraries as described in Figures 4A to 4H, and 
several toxin variants are identified in each library as repeatafaie killers of the human 
melanoma cell line 5 18A2 (Fig. 5). We find that the Al chain Hbraries of figures 4 F, 4 G, 
and 4 H have similar cytotoxic properties as of that of the A chain libraries. We also find that 
the iibraries with mutations at cysteine 242 or o'steine 261, or at both cysteines, as described 
in Figures 4B, 4C, 4D, and 4E also have similar cjlotoxic pmperties as that of the "wild type" 
A chain library of Figure 4A. 

The abscissa represents the log concentration of toxin used to treat the cells and the 
ordinate depicts the observed percentage of cells that are viable after 48 hours. The closed 
triangles depicts the effect of the wild type toxin on 5i8A2 cells while the two most 
efficacious A chain variants were termed SAM#3 (open squares) and S AM#5 (X s>TObols), 

Promising A chain and Al chain variants trom each of the heptapeptide libraries are 
then re-screened against a panel of cell lines (Vero [Monkey, normal kidney]; PC-3 [Human, 
pixjstate cancer]; HepG2 [Human, hepatonia]; SiHa [Human, cervical cancer]; PanC [Hitman, 
pancreatic cancer]; SKBR-3 [Human, breast cancer], 518-A2 [Human, melanoma]; IJ87 
[Human, glioma]; BI6-F10 [Mouse, melanoma]; HS-216 [Human, normal fibroblast]; 
CAMA-1 [Human, breast cancer]; OVCar-3 [Human, ovarian cancer]). Of these variants, a 
ssignificant proportion are observed to have activity against each of the cancer cell line, the 
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518-A2 human melanoma, and tlie SiHa (human cervical cancer cells) and U87-A (human 
brain cancer cells; glioma) cell lines. 

The genes coding for tlie two A chain toxins (SAM3 and SAMS) that resulted in 
toxicity of the human melanoma cell lines were sequenced to determine the amino acid 
sequences inserted between residues 245 and 246 of the wild-type A chain. The sequences, 
including the His- tag, are listed in Seq. ID Nos 4 and 5, respectively. 
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T^le 1 ; DNA sequences of randomly picked clones from the SLT-I A chain-tripeptide 
library. Mutated bases in bold characters. Mutant #41 was identified in our ELISA screen as 
a strong bijKler of mAb One M27 (Fig. 3). 



SLT-1 A chainNucieotide sequence Diversity 
variant (nucleotide changes in 

mutated region) 



MUCl epitope 


CCA GAC ACG CGA CCA OCT CCA 


0/9 


Mutant #1 


CCAGAC GGGATCGGG GOT CCA 


8/9 


Mutant #2 


CCA GAC CTG GAG ATG GCT CCA 


8/9 


Mutant #3 


CCA GAC CCC CGT GOG GCT CCA 


6/9 


Mutant #4 


CCA GAC GAC GAC TfG GCT CCA 


9/9 


Mutant #5 


CCA GAC GTC CGG TGG GCT CCA 


7/9 


Mutant #6 


CCA GAC CAG CGC TGG GCT CCA 


6/9 


Mutant #7 


CCA GAC CTC AGO ATG GCT CCA 


8/9 


Mutant #8 


CCAGAC TCCCAGGAG GCT CCA 


7/9 


Mutant #9 


CCA GAC TCC GAC CCC GCT CCA 


6/9 


Mutant HI 


CCA GAC ACG CGC CCC GCT CCA 


2/9 
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Tabie 2: Amino acid seqitence aiignnieiit of randomiy selected clones from the SLT-I A 
chaiii-tripeptide library and ELISA signal of purified SLT-1 A chain variaixts detected with a 
mAb (One M27) raised against the MUCl epitope Thr-Arg-Pro. Mutated tripeptide region in 
bold characters. Mutant #41 was identified in our ELISA screen as a strong binder of mAb 
One M27. 



SLT-1 A chainDeduccd amino acid sequence ELISA readings 

variant (405 nm) 



MUCl epitope 


CHHHPD TRP 


APASRVARMASDEFPSMC 


1.3 


Mutant #1 


CHHHPD GIG 


APASRVARMASDHFPSMC 


0.08 


Mutant #2 


CHHHPD LQM 


APASRVARMASDEFPSMC 


0.03 


Mutant #3 


CHHHPD PRC 


APASRVARMASDEFPSMC 


0.03 


Mutant #4 


CHHHPD DDL 


APASRVARMASDEFPSMC 


0.06 


Mutant #5 


CHI-U-IPD VRW APASRVARMASDEFPSMC 


0.07 


Mutant #6 


CHHHPD QRL 


APASRVARMASDEFPSMC 


0.06 


Mutant #7 


CHHHPD LRM 


APASRVARMASDEFPSMC 


o.n 


Mutant #8 


CHHHPD SQE 


APASRVARMASDEFPSMC 


0.13 


Mutant #9 


CHHHPD SDP 


APASRVARMASDEFPSMC 


0.07 


Mutant #41 


CHHHPD TRP 


APASRVARMASDEFPSMC 


L25 
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What is claimed is : 

1 . A combinatorial pTotein library comprising a plurality of protein species, each pixjtein 
species eoniprising an A chain of a toxic protein into which an insert has been introduced, 
wherein, 

(a) the insert is a polypeptide of var^ang amino acid sequence having a 
length of at least 2 amino acid residues; and 

(b) the insert is introduced into the protease-sensiti ve loop of the A chain 

sequence. 

2. The combinatorial protein library of claim 1 , wherein the library comprises at least 
100 protein species. 

3. The combinatorial protein library of claim I or 2, wherein the protein species are 
formed by introducing the insert into a Shiga-Iike toxin 1 A chain. 

4. The combinatorial protein library of claim 3, wherein the protein species is formed by 
introducing the insert between amino acids 242 and 261 , as defined with reference to the 
sequence listed in Figure 6. 

5. The combinatorial protein libraiy of claim 4, wherein the protein species is formed by 
introducing the insert between amino acids 245 and 246, as defined with reference to the 
sequence listed in Figure 6. 

6. The combinatorial protein !ibrar>' of claim 3, wherein the protein species is formed fay 
introducing the insert before or after amino acids 1-239 of ttie Shiga-like toxin I A chain, as 
defined with reference to the sequence listed in Figure 6. 
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7. The corabiiiatorial protein library of any preceding claim, wherein the insert has a 
length of 7 amino acids. 

8. A combinatorial expression library comprising a plurality of species of expression 
systems, each species expressing a protein species in accordance with any of claims 1-7. 

9. A mutant protein c-omprising an A chain of a toxic pmtein into which an insert has 
been introduced, wherein, 

(a) the insert is a pol>peptide of varying amino acid sequ^ce having a 
l ength of at least 2 amino acid residues; and 

(b) the insert is introauced into the proteasc-sensitive loop of the A chain 

sequence. 

1 0. The mutant protein of claim 9, whereiji the A chain of a toxic protein is a Shiga-like 
toxin I A chain, 

1 1 . The mutant pjxjtein of claim 1 0, wherein tlie insert in introduced between amino acids 
242 and 261, as defined with reference to the sequence listed in Fi^e 6- 

1 2. The mutant protein of claim 11, wherein the insert is introduced between amino acids 
245 and 246, as defined with reference to the sequence listed in Figure 6, 

13. The mutant protein of claim 12, wherein the insert comprises the sequence lYSNKLM 

(Seq. ID No. 6). 

14. The mutant protein of claim 12, wherein the insert comprises the sequence AAFADLi 
(Seq. ID No. 7). 
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1 5 . The mutant protein of claim 1 0, wherein the insert is intix)duced before or after amino 
acids 1-239 of the Shiga-like toxin I A chain, as defined with reference to the sequence listed 
in Figure 6. 

16. Hie mutant protein of any of claims 9 to 15, wherein the insert has a length of 7 
amino acids. 

17. A method for identifying a ligand that bind to a specific target/ receptor, comprising 
the steps of: 

(a) exposing cells known to possess the target/receptor to members of a 
combmatorial protein library in accordance with any of claims 1 to 7; 

(b) selecting members of the protein librar>' which are observed to be toxic to the 

cells; 

(c) evaluating the selected members of the protein library to determine the 
sequence of the inserted region, whereby a peptide of the sequence of the inserted region is 
identified as a possible Hgand for a target/receptor on the cell; and 

(d) further testing peptides of the sequence of the inserted region to confirra that 
they are a ligand for the specific target/receptor, 

18. A method for isolating a toxin specific for a known target/receptor comprising the 
steps of: 

(a) exposing the target/receptor to a combinatorial protein library according to any 
of claims 1 to 7; and 

(b) isolating at least one protein s|>ecies from the combinatorial protein library 
captured by binding to the target/receptor. 

1 9. The method of claim IS, further comprising the step of screening tlie isolated protein 
against cells expressing the target/receptor, to confirm their toxicity for cells expressing the 
targeti'receptor. 
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20. The method of claim 1 8, wherein the targeVreceptor is a purified target/receptor and is 
jimnobihzed on a solid support. 

21 . The method of claim 20, wherein the targeUreceptOT is on the surface of cells. 

22. The method of claim 21, wherein the cells are immobilized on a solid support. 

23. The method of claim 21 or 22, wherein the toxin serves as a reporter, and the death of 
the cells is indicative of receptor binding. 

24. A combinatorial proiem library comprising a plurality of proteni species, eacli protein 
species comprising a moditied A chain of a toxic protein into which an insert has been 
introduced, wherein, 

(a) the modified A chain comprises a wild-type A chain containing a 
mutation in at least one cysteine; 

(b) the insert is a polypeptide of varying amino acid sequence having a length 
of at least 2 amino acid residues; and 

(c) the insert is iniroduced into the region which would be a protease- 
sensitive loop in the wild-type A chain sequence. 

25. The combinatorial protein library of claim 24, wherein the Hbraiy comprises at least 
100 protein species. 

26. The combinatorial protein library of claim 24, wherei n tlie protein species are formed 
by introducing the insert into a Shiga-hke toxin I A chain containing a mutation in at least one 
cysteine, 

27. The combinatorial protein library of claim 24, 25 or 26, wherein the mutation in at 
least one cysteine is a mutation fixjm cysteine to an alanine. 
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28. The combinatorial protein library of claim 25, wherein the protein species is formed 
by introducing the insert between araino acids 242 and 261, as defined with reference to the 
sequence listed in figures 14A, 14C, or 15A. 

29. The combinatorial protein library of claim 26, wherein the protein species is formed 
by introducing the insert between amino acids 245 and 246, as defined with reference to ttie 
sequence listed in figures 14 A, 14C or 1 5A, 

30. The combinatorial protein h"brar>' of claim 26, wherein the protein species is formed 
by introducing the insert before or after amino acids 1-239 of the Shiga-tike toxin I A chain, 
as defined with reference to the sequence listed in figures I4A, 14C or 15A. 

3 1 . The combiiiatorial protein library of any of claims 24-30, wherein the insert has a 
length of 7 amino acids, 

32. A combinatorial expression library comprising a plurality of species of expression 
systems, each species expressing a protein species in accordance with any of claims 24-31, 

33. The combinatorial expression library of any one of ciaims 24 to 32, wherein the 
modified A chain comprises one cysteine, further comprising a label bound to said cysteine, 

34. The combinatorial expression libraiy of claim 33 wherein the label is a fluorescent 
label. 

35 . The combinatorial expression hbrary of claim 3 3 wherein the label is biotinylation or 
pegylation.. 

36. The combinatorial expression library of claim 33 wherein the label is a radioactive 
label. 
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37. A mutant protein comprising a modi fied A cliaiii of a toxic protein into which an 
insert has been introduced, wherein, 

(a) the modified A chain comprises a wild-type A chain containing a 
mutation in at least one cysteine; 

(b) the insert is a polypeptide of varying amino acid sequence having a 
length of at least 2 amino acid residues; and 

(b) the insert is introduced into a region which would be a protease- 
sensitive loop in the wild type A chain sequence. 

38. The mutant protein of claim 37, wherein the mutation in at least one cysteine is a 
mutation from cysteine to an alsmine. 

39. The mutmit protein of claim 37, wherein the modified A chain comprises one 
cysteine, further comprising a label bound to said cysteine. 

40. The mutant protein of claim 39 wherein the label is a fluorescent label. 

41 . The mut^t protein of claim 39 wherein the label is biotinylation or pegylation. 

42. The mutant protein of claim 39 wherein the label is a radioactive label. 

43. The mutant protein of claim 37, wherein the wild-tyjje A chain of a toxic protein is a 
Shiga-iike toxin I A chain. 

44. The mutiuit protein of claim 37, wherein tlie insert m introduced between amino acids 
242 ai^d 261, as defined witli reference to the sequences listed in figures i4A, i4C or 15A. 

45. The mutant protein of claim 37, wherein the insert is introduced between amino acids 
245 and 246, as defined with reference to the sequences listed in figures 14A, 14C or 15 A. 
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46. The mutant protein of claim 45, wherein the insert comprises the sequence lYSNKXM 
(Seq.IDNo.6). 

47. The mutant protein of claim 45, wherein the insert comprises the sequence AA.F.'VDLI 
(Seq. ID No. 7). 

48 . The mutant protein of claim 3 7, wherein the insert is introduced before or after amino 
acids 1-239 of a modified Shiga-like toxin 1 A chain, as defined with reference to the 
sequences listed in figures 1 4A, 14C or 15A. 

49. The mutant protein of any of claims 37 to 48, wherein the insert has a length of 7 
amino acids, 

50. A polypeptide sequence seiected froni the gimip consisting of the polypeptide 
sequence Usted in figures HC, 1 ID, 12A, i2B, 12C, or 12D. 

51. A method for identifying a ligand that bind to a specific target/receptor, comprising 
the steps of: 

(a) exposing cells known to possess the target/receptor to members of a 
combinatorial protein library in accordance with uxy of claims 24 to 36; 

(b) selecting members of the protein library which arc observed to be toxic to the 

cells; 

(c) evaluating the selected members of the protein library to determine the 

sequence of the inserted region, whereby a peptide of the sequence of the inserted region is 
identified as a possible ligand for a target/receptor on the ceil; and 

(d) further testing peptides of the sequence of the inserted region to confsrm that 
Ihey are a ligand for the specific target/receptor. 

52. A method for isolating a toxin specific for a known target/receptor comprising the 
steps of: 
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(a) exposing the target/receptor to a combinatorial protein library according to any 
of claims 24 to 36; and 

(b) isolating at least one protein species from the combinatorial protein librar>' 
captured by binding to the target/receptor. 

53. The method of claim 52, fiirthcr comprising the stq? of screening the isolated protein 
against cells expressing the target/receptor, to confirm their toxicity for cells expressing the 
target/receptor. 

54. The method of claim 52, wherein the target-receptor is a purified target/receptor and is 
immobilized on a solid support, 

55. The method of claim 54, wherein the target/receptor is on the surface of cells. 

56. The method of claim 5 5, wherein the ceils are immobi lized on a solid support. 

57. The method of claim 55 or 56, wherein the toxin serves as a reporter, and the deatli of 
the cells is indicative of receptor binding, 

58. A combinatorial protein library comprising a plurality of protein species, each protein 
species comprising an Al chain of a toxic protein into vvhich an insert has been introduced, 

whtaiein, 

(a) the insert is a pol>peptide of varying amino acid sequence having a 

length of at Seast 2 amino acid residues; and 

(b) the i nsert is introduced into the portion of the Al chain that would be a 
protease-sensitive loop if the Al chain was a ftill-length A chain. 

59. The combinatorial protein librasy of claim 58, wherein the library comprises at least 
100 protein species. 
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60. The eoinbinatorial protein library of claim 58 or 59, wherein the protein species are 
formed by introducing the insert into a Sbiga-like toxin I Al chain. 

6 1 . The corabinatoriai protein library of claim 60, wherein the protein species is formed 
by introducing the insert between amino acids 242 and 251, as defined with reference to 
figure 15C. 

62 . The combinatorial protein library of claim 6 1 , wherein the protein species is formed 
by introducing the insert between amino acids 245 and 246, as defined with reference to 
figure 15C. 

63. The combinatorial protein library of claim 60, wherein the protein species is formed 
by introducing the insert before or after amino acids 1-239 of the Shiga-Uke toxin 1 A l chain, 
as defined with reference to figure 15C. 

64. The combinatorial protein library of any one of claims 58-63, wherein the insert has a 
length of 7 amino acids, 

65. A combinatorial expression library comprising a plurality of species of expression 
systems, each species expressing a protein species in accordance with any of claims 58-64. 

66. A combinatorial expression library of any one of claims 58-65 further comprising a 
label bound to a cysteine in said Al chain, 

67. The combinatorial expression library of claim 66 wherein the label is a fluorescent 
label, 

68. The combinatorial expression library of claim 66 wherein the label is biotinylation or 
pegylatlon. 
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69. The combiaalorial expression library of claim 66 wherein the label is a radioactive 
label. 

70. A mutant protein comprising an Al chain of a toxic protein into which an insert has 
been introduced, wherein, 

(a) the insert is a polypeptide of varying amino acid seciuence having a 
length of at ie^t 2 amino acid residues; and 

(b) the insert is introduced into the portion of the A I chain that %vould be a 
protease-sensitive loop if the Al chain was a full length A chain. 

7 1 . The mutant protein of claim 70 further comprising a label bound to a cysteine in said 
Al chain, 

72. The mutant protein of claim 7 i wherein the label is a fluorescent label, 

73. The mutant protein of claim 7 1 wherein the label is biotinylation or pegylation. 

74. The mutant protein of claim 71 wherein the label is a radioactive label. 

75. The mutant protein of claim 70, wherein the Al chain of a toxic protein is a Shiga-Iike 
toxin I Al chain. 

76. The mutant protein of claim 75, wherein the insert in introduced between amino acids 
242 and 251 , as defined with reference to figure ISC. 

77. The mutant protein of claim 76, wherein the insert is introduced between amino acids 
245 and 246, as defined with reference to figure 15C, 

78. The mutant protein of claim 77, wherein the insert comprises the sequence lYSNKLM 
(Seq. ID Mo. 6). 
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79. The mutajit protein of claim 77, wherein the insert comprises the sequence AAFADLI 
(Seq. ID No. 7). 

80. The mutant protein of claim 75, wherein the insert is introduced before or after amino 
acids 1 -239 of the Shiga-Uke toxin I Al chain, as defined with reference to figure 1 5C. 

81 . The mutant protein of any of claims 70 to 80, wherein the insert has a length of 7 
amino acids. 

82. A method for identifying a iigand that bind to a specific target/receptor, comprising 
the steps of: 

(a) exposing cells known to possess the target/receptor to members of a 
combinatorial protein libr.ir}'' in accordance with any of claims 58 to 69; 

(b) selecting members of the protein library which are observed to be toxic to the 

cells; 

(c) evaluating the selected members of the protein library to determine the 
sequence of the inserted region, whereby a peptide of the sequence of the inserted region is 
identified as a possible ligand for a target/receptor on the cell; and 

(d) further testing peptides of the sequence of the inserted region to confirm that 
they are a iigand for the specific target/receptor, 

83. A method for isolating a toxin specific for a known target/recepior comprising tlie 
steps of: 

(a) exposing the target/ receptor to a combinatorial protein library' according to any 
of claims 58 to 69; and 

(b) isolating at least one protein species from the combinatorial protein library 
captured by binding to the target/receptor. 
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84. The method of ciaim 83, furtlier comprising the step of screening the isolated protein 
against cells expressing the target/receptor, to confirm their toxicity for celte expressing the 
target/receptor, 

85. The method of claim 83, wherein the target/receptor is a purified target/receptor and is 
immobihzed on a solid support. 

86. The method of claim 85, wherein the target/receptor is on the surface of cells. 

87. The method of claim 86, wherein the cells are immobilized on a solid support. 

88. The method of claim 86 or 87, wherein the toxin serves as a reporter, and the death of 
the cells is indicative of receptor binding, 

89. A combinatori al protein library comprising a pl urality of protein species, each protein 
species comprising a modified Al chain of a toxic protein into which an insert has been 
uitroduced, wherein, 

(a) the modified Al chain comprises a wild-type Al chain containing a 
mutation in one cysteine; 

(b) the insert is a polypeptide of var>'ing amino acid sequence having a length 
of at least 2 amino acid residues; and 

(c) the insert is introduced into the portion of the A 1 chain that would be a 
protease-sensitive loop if the Al chain was a full length A chain. 

90. The combinatorial protein UbraT).' of claim 89, wherein the library comprises at l east 
1 00 protein species. 

9 1 . The combinatorial protein library of claim 89 or 90, wherein the protein species are 
fanned by introducing the insert into a Sbiga-like toxin I Al chain containing a mutation in 
one cysteine. 
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92. The combinatorial protein library of any one of claims 89 to 9 1 , wherein the mutation 
in one cysteine is that the cysteine is modified to an alanine, 

93. The combinatorial protein library of claim 91 , wherein the protein species is fonned 
by introducing the insert between amino acids 242 and 261, as defined with retercnce to 
figure 16A. 

94. The combinatorial protein Ubrarv' of claim 93, wherein the protein species is formed 
by introdiicing tlie insert between amino acids 245 and 246, as defined with reference to 
figure 16 A. 

95. The combinatorial protein library of claim 91, wherein the protein species is formed 
by introducing the insert before or after amino acids 1-239 of the Shiga-like toxin I Al chain, 
as defined with reference to figure 16A. 

96. The combinatorial protein library of any of claims 89-95, wherein the insert has a 
length of 7 amino acids. 

97. A combinatorial expression library comprising a pluraii ty of species of expression 
systems, each species expressing a protein species in accordance with any of claims 89-96. 

98. A mutant protein comprising a moditied Al chain of a toxic protein into which an 
insert has been introduced, wherein, 

(a) the modified Al chain comprises a wild-type Al chain containing a 
mutation in one cysteine; 

(b) the insert is a polypeptide of varj'ing amino acid sequence having a 
length of at least 2 amino acid residues; and 

(b) the insert is introduced into the portion of the Al cabin that would be a 
protease-sensitive loop if the Al chain was a full-length A chain. 
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99. The mutant protein of claim 98, wherein the wild-type Al chain of a toxic protein is a 
Shiga-like toxin I Al chain. 

100. The mutant protein of claim 99, wherein the insert in introduced between amino acids 
242 and 261, as deiined with reference to figure 1 6 A. 

101 . The mutant protein of claim 100, wherein the insert is introduced between amino 
acids 245 and 246, as defined with reference to figure 15A. 

102. The mutant protein of claim 101, wherein the insert comprises the sequence 
lYSNKLM (Seq. ID No. 6). 

1 03 . The rautatit proteiu of claim 101, wherein the insert comprises the sequence 
AAFADU (Seq. ID No. 7). 

104. The mutant protein of claim 99, wherein the insert is introduced before or after jmiino 
acids 1-239 of a modified Shiga-like toxin 1 A! chain, as defined with reference to figure 
16A. 

1 05 . The mutant protein of any of claims 98 to 1 04, wherein the insert has a length of 7 
amino acids. 

106. A polypeptide sequence selected from the group coiisisting of the pol>p6ptjde 
sequence of figures 13 A, 13B, 13C, 13D, 16A, or 16B, , 

107. A method for identifying a iigand that hind to a specific target/receptor, comprising 
the steps of: 

(a) exposing cells known to possess the target/receptor to members of a 
combinatorial protein library in accordance with any of claims 89 to 97; 
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(b) selecting members of the protein librar>' which are observed to be toxic to the 

cells; 

(c) evaluating the selected member of the protein library to determine the 
sequence of the inserted region, whereby a peptide of the sequence of the inserted region is 
identified as a possible hgand for a target/receptor on tlie cell; and 

(d) further testing peptides of the sequence of the inserted region to confirm that 
they are a ligand for the specific target/receptor. 

1 08. A method for isolating a toxin speciiic for a known target/receptor comprising the 
steps of; 

(a) exposing the target/receptor to a combinatorial protein library according to any 
of claims 89 to 97; and 

(b) isolating at least one protein species from the combinatorial protein library 
captured by binding to the target/receptor. 

1 09. The method of claim 1 08, further comprising the step of screening the isolated protein 
against ceils expressing the target/receptor, to confirm their toxicity for cells expressing the 
target/receptor. 

1 10. The method of claim 1 08, wherein the target/receptor is a purified target/receptor and 
is immobilized on a solid support. 

111. The method of claim 1 08, vi'herein the tat get receptor is on the surface of cells. 

1 12. The method of claim HI, wherein the ceils are immobilized on a solid support, 

113. The method of claim 1 U or 112, wherein the toxin serves as a reporter, and the death 
of the ceils is indicative of receptor binding. 
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114. The combinatorial proteiii library of any one of claims 1 -8 wherein the insert is 
introduced at the N-terminus of the A chain sequence instead of in the protease seasitive loop 
region. 

US, The mutant protein of any one of claims 9-16 wherein the insert is introduced at theN- 
terminus of the A chain sequence instead of in the protease sensitive loop region. 

1 1 6, The combinatorial protein librar>' of any one of claims 24-36 wherein the insert is 
introduced at the N-termius of the modified A chain instead of in the region that would be a 
protease sensitive loop in the wiid-t>'pe A chain sequence, 

317. The mutant protein of any one of claims 37-49 wherein the insert is introduced at the N- 
termius of the modified A chain instead of in the region thai would be a protease sensitive 
loop in the wild-type A chain sequence. 

1 18. The combinatorial protein library of any one of claims 58-69 or 89-97 wherein the insert 
is introduced at the N-terminus of the A 1 chain instead of into the portion of the Al chain 
that would be a proteasc-sensitive loop if the Al ch ain was a full length A chain. 

119. The mutant protein of any one of claims 70-81 or 98-105 wherein the insert is 
introduced at the N-terminus of the Al chain instead of into the portion of the Al chain that 
would be a protease-sensitive loop if the Al chain was a full length A chain, 

120. A method for identif>'ing a ligand that bind to a specific target/'receptor, comprising 
the steps of: 

(a) exposing cells known to possess ihe target/receptor to members of a 
combinatorial, protein library in accordance with any of claims 1 14, 116, or 1 18; 

(b) selecting members of the protein library which are observ- ed to be toxic to the 

cells; 
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(c) eval uating the selected members of the protein library to determine the 
sequence of the inserted region, whereby a peptide of the sequence of the inserted region is 
identified as a possible ligaiid for a target/receptor on the cell; and 

(d) fiirther testing peptides of the sequence of the inserted region to confirm that 
they are a ligand for the specific target/receptor. 

121. A method for isolating a toxin specific for a known target/receptor cotnprising the 
steps of: 

(a) exposing the target'ieceptor to a combinatorial protein library according to any 
of claims 114, 1 16 or 1 18; and 

(b) isolating at least one protein species from the combinatorial protein library 
captured by binding lo the target/receptor. 

122. The method of claim 121, iurther comprising the step of screening the isolated protein 
against cells expressing the targetf'receptor, to confirm their toxicity for cells expressing the 
target'receptor. 

123. The method of claim 121, wherein the targef receptor is a purified target/receptor and 
is immobilized on a solid support. 

124. The method of claim 121, wherein the targef receptor is on the surfece of cells. 

125. Tlie method of claim 124, wherein the ceils are immobilized on a solid support. 

] 26, Tlie method of ciaim 1 24 or 1 25, wherein the toxm serv-es as a reporter, and the death 
of the cells is indicative of receptor binding. 

127. The method of any one of claims 1 to 23 or 82 to 88, wherein the library is 
biodiiylated. 
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128. The library of any one of claims 24 to 36 or 89 to 97, wherein the library i 
biotinylated. 
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Fig. 2D 
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Fig. 2G 
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Fig. 2J 
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Repres^itative ELISA data set from 
screening 96 distinct toxin variants 
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Fig. 4A 
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Fig. 4D 
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Fig. 4G 
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Fig. 5 
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Figure 6 



KEFTLDFSTAKTyVDSLNVIRSAIGTPLQTISSGGTSLLMIDSGSGDNLFAVDVRGIDPE 
EGRFNNLRLIVERNNLYVTGFVNRTKHVFYRPADFSHVTFPGTTAVTLSGDSSYTTLQRV 

AGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRF¥TVTAEALRFRQIQRGFRT 
TLDDLSGRSYVMTAEDVDLTLEWGRLSSVLPDYHGQDSVRVGRISFGSINAILGSVALIL 
NCHHHASRVARMASDEFPSMCPADGRVRGITHNKILWDSSTLGAILMRRTISS 
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Figtjxe 7 

7A: 

Sequence for Figure 2A minus BIS -Tag, protease sensitive 
region 

KSPTLDPSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMIDSGSGDNLPAVDVRGIDPE 
EGRPNNLRLIVERNNLYVTGFVKRTNNVFyRFADFSHVTFPGTTAVTLSGDSSYTTLQRV 
AGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRFVTVTAEALRFRQIQRGPRT 
TIiDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRVGRISFGSINAILGSVALIL 
NCHHHPDTRPAPASRVARiVlASDEFPSMCPADGRVRGITHNKILWDSSTLGAILMRRTISS 

7B; 

Sequence for Figure 2A including HIS -tag, protease sensitive 

region 

KGMRSHHHHHHHHIEGRASKEFTLDPSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMI 
DSGSGDNLPAVDVRGIDPEEGRFKWLRLIVERKNLYVTGPWRTNNVPYRPADFSHVTFP 
GTTAVTLSGDSSYTTLQRVAGISRTGMQINRHSLTTSYLDLKSHSGTSLTQSVARAMLRP 
VTOTAEALRPRQIQRGFRTTLDDLSGRSYWTAEDVDLTLNWGRLSSVLPDYHGQDSVRV 
GRI SFGS INAILGSVALIliNCHHHPDTRPAPASRVARMASDEFPSMCPADGRVRGITHNK 
ILWDSSTLGAILMRRTISS 

7C! 

Sequence for Figure 2B minus BIS -Tag, Protease Sensitive 

Region 

KEFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMIDSGSGDNLFAVDVRGIDPE 
EGRFNNLRLIVERl^NLYVTGPTORT^iNVFYRFADFSHVTFPGTTAVTLSGDSSYTTLQRV 
AG I SRTGMQ I WRHSLTTS YLDLMSHSGTSLTQSVARAMLRPVTVTAEALRPRQI QRGFRT 
TLDDLSGRSYVMTAEDVDLTLNVJGRLSSVLPDYHGQDSVRVGRI SFGS INAILGSVAL XL 
NCHHHPDXXXAPASRVARMASDEFPSMCPADGRVRGITHNKILWDSSTLGAILMRRTIS3 

7Ds 

Sequence for Figure 2B including HIS-Tag, Protease Sensitive 

Region 

KGMRSHHHIffi-HHHI EGRAS KEFTLDFST AKT Y^/DSLNVl RS AIGTPLQT I SSGGTS LLM I 
DSGSGDNLFAVDVRGIDPEEGRP1SINLRLIVER^^NLYVTGFVKRTNHVFYRFADFSHVTFP 
GTTAVTLSGDSSYTTLQRVAGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRF 
VTVTAKALRFRQIQRGFRTTLDDLSGRSYVMTAEDVDLTLJWGRLSSVLPDYHGQDSVRV 
GRISFGSINAILGSVALILNCHHHPDXXXAPASRVARMASDEFPSMCPADGRVRGITHNK 
ILWDSSTLGAILMRRTISS 
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Figure 8 
8A: 

Se<juence for Figures 2C & 2F minus HIS-Tag, Protease 
Sensitive Eegion 

KEFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMIDSGSGDNLPAVDVRGIDPE 
EGRFNNLRLIVERNHLYVTGFVNRTNNVFYRFADFSHVTPPGTTAVTLSGDSSYTTLQRV 
AGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRFVTVTAEALRFRQIQRGFRT 
TLDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRVGRISFGSINAILGSVALIL 
NCHHHPDXXXAPASRVARMASDEFPSMAPAOGRVRG I THNK X LWDSSTLGAILMRRTI SS 



8B: 

Sequence for Figures 2C fc 2F including HIS-Tag, Protease 
Sensitive Region 

KGMRSHHHHKHHHIEGRASKEFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMI 
DSGSGDNLPAVDVRGIDPEEGRFNNLRLIVERKfKLYVTGFVNRTNN\'FYRPADFSHVTPP 
GTTAVTLSGDSSYTTLQRVAGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRF 
VTVTAEALRFRQIQRGFRTTLDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRV 
GRISFGSINAILGSVALILNCHHHPDXXXAPASRVARMASDEFPSMAPADGRVRGITHNK 
I LWDS STLGAI LMRRT I S S 



8C: 

Sequence for Figure 2D minus KIS-Tag, Protease Sensitive 
Region 

KEFTLDFSTAKTYVDSLmriRSAIGTPLQTISSGGTSLLMIDSGSGDNLFAVDVRGIDPE 
EGRFNNLRLIV{SRNNl;YVTGPWRTNNVFYRFADFSH\'TFPGTTAVTLSGDSSYTTLQRV 
AGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMI.RPVTVTAEALRFRQIQRGPRT 
TLDDLSGRSYWTAEDVDLTLNWGRLSSVLPDYHGQDSVRVGRISFGSINAILGSVALIL 
NAHHRPDXXXAPASRVARMASDEFPSMCPADGRVRGITHNKI LWDSSTLGAILMRRT 1 SS 



8Dt 

Sequence for Figure 2D including HIS-Tag, Protease Sensitive 
Region 

KGMRSHHHHHHHHI EGRASKE FTLDFSTAKTYVDSLNV I RSAIGTPLQT I S SGGTSLIJ4 1 
DSGSGDKLFAVDVRGIDPEBGRFNNLRLIVER^WLYVTGFVNRTNNVFYRFADPSHVTFP 
GTTAVTLSGDSSYTTLQRVAGISRTGMQimiHSLTTSYLDLMSHSGTSLTQSVARAMLRF 
VTVTAEALRFRQIQRGFRTTLDDLSQRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRV 
GRISFGSIKAILGSVALILNAHHHPDXXXAPASRVARMASDEFPSMCPADGRVRGITHWK 
ILWDSSTLGAILMRRTISS 
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Figure 9 
9 As 

Sequence for Figure 2E minus HlS^Tag, Protease Sensitive 
Region 

KEFTLDFSTAKTyVDSLNVIRSAIGTPLQTISSGGTSLLMIDSGSGDNLFAVDVRGIDPE 
EGRFNKLRLIVERNNLYVTGFVNRTlWVFYRPADPSHVTFPGTTAVTX.SGDSSyTTLQRV 
AGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARJyMLRPV'TVTAEALRFRQIQRGFRT 
TLDDLSGRS YVMTAEDVDLTLNWGRLSS VLPDYHGQDSVRVGR I SFGS INAI LGSVAL I L 
NAHHHPDXXXAPASRVARMASDEFPSMAPADGRVRGITHNKILWDSSTLGAILMRRTISS 

SB I 

Sequence for Figure 2E including HIS-Tag, Protease Sensitive 
Region 

KGMRSHHHHHHHHIEGRASKEFTLDFSTAKTYVDSLKVIRSAIGTPLQTISSGGTSLLMI 
DSGSGDNLFAVDTOGIDPEEGRFim,RLIVERNNLYVTGFVNRTNWVFyRFADFSHVTFP 

GTTAVTLSGDSSYTTLQRVAGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRF 
VTVTAEALRFRQIQRGFRTTLDDLSGRSYVMTAEDVDLTLITOGRLSSVLPDyHGQDSVRV 
GRI SFGS INAI LGSVALI LNAHHHPDXXXAPASRVARJMASDEFPSMAPADGRVRGITHNK 
ILWDSSTliGAILMRR-TlSS 

9Cj 

Sequence for Figure 20 minus HIS-Tag, Protease Sensitive 
Region 

KEFTLDPSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMIDSGSGDNLFATOVRGIDPE 
EGRFtJNLRLIVERNNLy^/TGFWRTMfTVFYRFADFSHVTFPGTTAVTLSGDSSYTTLQRV 
AGISRTGMQINRHSI..TTSYLDLMSHSGTSI1TQSVARAMLRFVTVTAEALRFRQIQRGFRT 
TLDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRVGR I SFGS INAI LGSVAL IL 
NCHHHPDTRPAPASRVAR 

9D{ 

Sequence for Figure 2Q Including HIS "Tag, Protease Sensitive 
Region 

KGMRSHHHHHHHHIEGRASKEFTLDFSTAKTYVDSLmaRSAIGTPLQTISSGGTSLLMI 
DSGSGDNLFAVtiVRG I D PEEGRFNNLRL I VE RNl^LYVTGFVNRTNOTF YRFADFSHVTFP 
GTTAVTLSGDSSYTTLQRVAGISRTGMQIIJRHSLTTSYLDLMSHSGTSLTQSVARAMLRF 
VTVTAEALRFRQIQRGFRTTLDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRV 
GRlSFGSINAILGiSVALlLKCHHHPDTRPAPASRVAR 



Figure 10 
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lOAs 

Sequence for Figures 2H & 2J minus HlS-Tag, Protease 
Sensitive Region 

KEFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMIDSGSGDNLFAVDVRGIDPE 
EGRFNNLRLIVERUSLYVTGFVNRTNNVFYRFADFSHVTFPGTTAVTLSGDSSyTTLQRV 

AGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRFVTVTAEALRFRQIQRGFRT 
TLDDLSGRSYVMTAEDVDLTLSWGRLSSVLPDYHGQDSVRVGRISFGSINAILGSVALIL 
NCHHHPDXXXAPASRVAH 

XOB: 

Seqiience for Figures 2H & 2j including HIS-Tag, Protease 
Sensitive Region 

KGMRSHHHHHHHHIEGRASKEFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMI 
DSGSGDNLFAVDVRGIDPEEGRPNNLRLIVERNNLYVTGFVNRTlfNVFYRPADFSHVTFP 
GTTAVTLSGDSSYTTLQRVAGISRTGMQIJJRHSLTTSYLDLMSHSGTSLTQSVARAMLRF 
VTVTAEALRFRQIQRGFRTTLDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRV 
GRI SFGS INAILGSVALILNCHHHPDXXXAPASRVAR 

IOC: 

Se«5uence for Figure 21 minus HIS-Tag, Protease Sensitive 
Region 

KEFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMIDSGSGDNLFAVDVRGIDPE 
EGRFNNLRLIVERNNLYVTGFVNRTNNVFYRFADFSHVTFPGTTAVTLSGDSSYTTLQRV 
AGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARA^ILRFVTVTAEALRFRQIQRGFRT 
TLDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRVGRISFGSINAILGSVALIL 
NAHHHPDXXXAPASRVAR 

lOD: 

Sequence for Figure 21 including HIS-Tag, Protease Sensitive 
Region 

KGMRSHHHHHHHHIEGRASKEFTLDPSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMI 

DSGSGDNLFAVI)VRGI DPEEGRFIWLRLI VEPJNNLYVTGFVNRTOTJVFYRFADFSHVTF P 
GTTAVTLSGDSSYTTLQRVAGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRF 
VTVTAEALRFRQIQRGFRTTLDDLSGRSY^/KTAEDVDLTliNWGRLSSVLPDYHGQDSVRV 
GRI SFGS INAI LGS VAt.1 tNAHHHPDXXXAPASRVAR 
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Figure 11 
llA 

Sequence for Figure 4A minus HIS -Tag, Protease Sensitive 

Region 

KEPTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMIDSGSGDNLFAVDVRGIDPE 
EGRFNNLRLIVERNNLYVTGFVNRTNNVFyRFADFSHVTFPGTTAV'Tl.SGDSSyTTLQRV 
AGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRFVTVTAEALRFRQIQRGFRT 
TLDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRVGRISFGSINAILGSVALIL 
NCHHHXXXXXXXASR¥ARMASDEFPSMCPADGRVRGITHKKILWDSSTLGAILMRRTISS 

IIB 

Seq:uence £or Figure 4A including HIS-Tag, Protease Sensitive 
Region 

KGMRSHHHHHHHHIEGRASKEFT1.DFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLI.MI 
DSGSGDNLFAVDVRGIDPEEGRFKNLRLIVERim.YVTGFV]!^RTKNVFYRPADFSrr/TFP 
GTTAVTLSGDSSYTTLQRVAGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAt^LRF 
VTVTAEALRFRQIQRGPRTTLDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRV 
GRISFGSINAILGSVALilLNCHHHXXXXXXXASRVARMASDEFPSMCPADGRVRGITHNK 
ILWDSSTLGAILMRRTISS 

lie 

Sequence for Figures 4B & 4E minus HIS-Tag, Protease 
Sensitive Region 

KEFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMIDSGSGDNIjFAVDVRGIDPE 

SGRFNNLRLIVERNNLYVTGFVNRTtinSfVFYRFADFSHVTPPGTTAVTLSGDSSYTTI^QRV 
AGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRFVTVTAEALRFRQIQRGFRT 
TLDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRVGRISFGSINAILGSVALIL 
KCHHHXXXXXXXASRVARMASDEFPSMAPADGRVRGITHWKILWDSSTLGAILMRRTISS 

liD 

Sequence for Figures 4B & 4E including HIS-Tag, Protease 
Sensitive Region 

KGMRSHHHHKHHHrEGRASKEFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMI 
DSGSGDNLFAVDVRGIDPEEGRFNNLRLIVKRNNLYVTGFVNRTNKVFYRFADFSHVTFP 
GTTAVTLSGDSSYTTLQRVAGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRF 
VTVTAEALRFRQIQRGFRTTLDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRV 
GRISFGSINAILGSVALIIJSfaiHHXXXXXXXASRVARMASDEFPSMAPArKSRVRGITHNK 
ILWDSSTLGAILMRRTISS 



Figure 12 
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12A 

Sequence for Figure 4C rainus HIS -Tag, Protease Sensitive 
Region 

KEFTLDFSTAKTYVDSLNVIRSAlGTPLiQTISSGGTSLLMIDSGSGDNLFAVDVRGIIDPE 
EGRFMNLRL IVERKKLYVTG FVNRTNNVFYRFADFSHVTFPGTTAVTLSGD S SYTTLQRV 
AGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRPVTVTAEALRFRQIQRGFRT 
TI<DDt.SGRSyVMTAEDVDLTLNWaRLSSVLPDYHGQDSVRVGRISFGSINAILGSVALIL 
NAHHHXXXXXXXASRVARMASDEFPSMCPADGRVRGITHNKILWDSSTLGAILMRRTISS 

12B 

Sequence for Figure 4C including KIS-Tag, Protease Sensitive 
Region 

KGMRSHHHHHHHHI EGRASKEFTLDFSTAKTYVDS LNVIRS AIGTPLQT I SSGOTSLLM I 
DSGSGDNLFAVDVRGlDPEEGRFNNLRLIVERNNLy\?TGFVNRTNNVFYRFADFSHVTPP 
GTTAVTLSGDSS YTl'LQR VAG I SRTGN3Q I P,fRKSLTTS YLDLMSHSGTSLTOSVARAMLRF 
VTVTAEALRPRQIQRGPRTTLDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRV 
GRI SFGS IKAILGSVALI LNAHHHXXXXXXXASRVARMASDEFPSMCPADGRVRG ITHNK 
ILWDSSTLGAILMRRTISS 



12C 

Sequence for Figure 4D minus HIS-Tag, Protease Sensitive 
Region 

KEPTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMIDSGSGDNLFAVDVRGIDPE 
BGRFim.RLIVERNNLyVTGFWRT]SnWFYRFADFSHVTPPaTTAVTLSGDSSYTTL^^ 
AG T SRTGMQI NRH SLTTS YLDLMSHSGTSLTQSVARAMLRF VTVTAEALRPRQI QRGFRT 
TLDDLSGRSYVMTAEDVDLTliNWGRLSSVIaPDYHGQDSVRVGRISFGSlNAILGSVALIL 
NAHHHXXXXXXXASRVARMASDEFPSMAPADGRVRGITHNKILWDSSTLGAILMRRTISS 

Sequence for Figure 4D including HIS-Tag, Protease Sensitive 
Region 



KGMRSHHHHHHHHIEGRASKEFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMI 
DSGSGDNLFAVDVRGIDPEEGRFNRLRLIVER14NLYVTGFVNRTNNVFYRFADFSHVTFP 
GTTAVTLSGDSSYTTLQRVAGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRF 

VTVTAEALRFRQIQRGFRTTLDDLSaRSY\^TAEDTOLTLNWGRLSSVLPDYHGQDSVRV 
GRISFGSINAILGSVALILNAHHHXXXXXXXASRVARMASDEFPSMAPADGRVRGITHNK 
ILWDSSTLQAILMRRTISS 



Figure 13 

13A 
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Sequence for Figures 4F & 46 aiinus HlS-Tag, Protease 
Sensitive Region 

KEFTLDFSTAKTYVDS1.NVIRSAIGTPLQTISSGGTSLLM1DSGSGDNLFAVDVRGIDPE 
EGRFWNLRLIVERNNLYVTGFVNRTNWPyRFADFSHVTFPGTTAVTLSGDSSYTTl.QRV 
AGI SRTGMQ I NRHSLTTSYLDLMSHSGTSLTQS VARAMLRFVTVTAEALRFRQ I QRGFRT 
TLDDLSGRSyVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRVGRISFGSINAILGSVALIL 
NCHHHXXXXXXXASRVAR 

13B 

Sequence for Figvree 4F & 4G including HlS-Tag, Protease 
Sensitive Region 

KGMRSHHHHHHHHIEGRASKEFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMI 
DSGSGDNLFAVDVRGIDPEEGRFNNLRLIVERNNLYVTGFVNRTNKVPYRFADFSHVTFP 
GTTAVTLSGDSSYTTLQRVAGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRF 
VTVTAEALRFRQIQRGFRTTLDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRV 
GRISFGSINAILGSVALILNCHHHXXXXXXXASRVAR 

13C 

Sequence for Figure 4K minus HIS -Tag, Protease Sensitive 
Region. 

KEFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMXDSGSGDNLFAVDVRGIDPE 
EGRFtJNLRLIVERNNLYVTGFVNRTNNVFYRFADPSHVTFPGTTAVTLSGDSSYTTLQRV 
AGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRFVTVTABALRFRQIQRGPRT 
TLDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRVGRI SFGS I NAI LGS VALIL 
NAJHHHXXXXXXXASRVAR 

13D 

Sequence for Figure 4H including HIS-Tag, Protease Sensitive 
Region 

KGMRSHHHHHHHHIEGRASKEFTLDFSTAKTY^/DSLNVIRSAIGTPLQTISSGGTSLLMI 
DSGSGDNLPAVDVRGIDPEEGRFNNLRLlVERmLYVTGFVNRTNNVFYRFADFSHVTFP 
GTTAVTLSGDSSYTTLQRVAGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRP 
VTVTAEALRFRQIQRGFRTTLDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRV 
GRISFGSINAII.GSVALILNAHHHXXXXXXXASRVAR 
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Figure 14 
14 A 

Sequence wild- type A region with C2€1A 



KEFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMIDSGSGDNLFAVDVRGIDPE 
EGRFNKLRLIVBRHNLYVTGFraRTNNVFYRFADFSHVTFPGTTAVTLSGDSSyTTLQRV 
AGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRFVTVTAEALRFROIQRGFRT 
TLDDLSGRSy VMTAEDVDLTLNWGRLSSVLPDYHGODSVRVGR I SFGS INAILGSVALI L 
NCHHHASRVARMASDEFPSMAPADGRVRG ITHNKILWDSSTLGAI LMRRTISS 

14B 

Sequence for A region with C261A and including HIS-Tag, 
Protease Sensitive Kegion 



KGMRSHHHHHHHHIEGRASKEFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMI 
DSGSGDNLFAVDVRGIDPEEGRFlWJLRLIVERlWtYVTGFVKRTNWVFyRFADFSHVTFP 
GTTAVTLSGDSSYTTLQRVAGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRF 
VTVTAEAtiRFRQIQRGFRTTLDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRV 
GRISFGSINAILGSVALILNCHHHASRVARMASDEFPSMAPADGRVRGITHNKILWDSST 
LGAILMRRTISS 

14C 

Sequence for wild- type A region with C242A 

KEFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMIDSGSGDNLFAVDWGIDPE 
BGRFNNLRLiIVSRNNLYVTGFVNRTNNVFYRFADPSHVTFPGTTAVTLSGDSSYTTLQRV 
AGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRFVTVTAEALRFRQIQRGFRT 
TLDDLSGRSYVMTAEDVDLTLIWGRLSSVLPDYHGQDSVRVGRISFGSINAILGSVALIL 
NAHHHASRVARMASDEFPSMCPADGRVRGITHNKILWDSSTLGAIL3VIRRTISS 

14D 

Sequence for A region with C242A and including KIS-Tag, 
Protease Sensitive Region 



KGMRSHHHHHHHHIEGRASKEFTLDFSTAKTYVDSLIWIRSAIGTPLiQTISSGGTSLLMI 
DSGSGDNLFAVDVRGIDPEEGRFNNLRLIVERNNLYVTGFVNRTNNVFYRFADFSHVTFP 
GTTAVTLSGDSSYTTLQRVAGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRF 
VTVTAEALRFRQIQRGFRTTLDDLSGRSYVMTAEDVDLTLlWGRLSSVLPDyHGQDSVRV 
GRISFGSINAILGSVALILNAHHHASRVARMASDEFPSMCPADGRVRGITHNKILWDSST 
LGAILMRRTISS 
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Figure 15 . 
X5A 

Sequence for A region with C242A, C2€1A 

KEFTLDPSTAKTYVDSIaKVIRSAIGTPLQTISSGGTSLLMIDSGSGDNLFAVDVRGIDPE 

EGRFNNLRLIVERNKLYVTGFVNRTNWVFVRFADFSHVTFPaTTAVTLSGDSSYTTL.QRV 
AGI SRTGMQ INRHSLTTS YLDLMSHSGTSLTQSVARAMLRFVTVTAEALRFRQ I QRGFRT 
TLDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRVGRISFGSINAILGSVALIL 
NAHHHASRVARMASDEPPSMCPADGRVRGITHNKILWDSSTLGAILMRRTISS 

15B 

Sequence for A region with C242A, C261A, and including HIS- 
Tag, ]^ro tease Sensitive Region 

KGMR SHHHHHHHH I EGRAS KE FTLDPSTAKTYVDSLNVI RSA2 GTPLQTI SSGGTS LLMI 
DSGSGDNLF^VDVRGIDPEEGRFNNLRLIVERHNLYVTGFVNRTNNVFYRFADFSHVTFP 
GTTAVTLSGDSSYTTLQRVAGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRF 
VTVTAEALRPRQIQRGFRTTLDDLSGRSYVMTAEDVDLTLlSrWGRLSSVLPDYHGQDSVRV 
GRISFGSINAILGSVALILNAHHHASRVARMASDEFPSMCPADGRVRGITHNKILWDSST 
LGAILMRRTISS 

15C 

Seqpience for Al region 

KEFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSIJ,MIDSG3GDNLFAVDVRGIDPE 
EGRFKNLRLIVERNNLYVTGPVNRTNNVFYRFADFSHVTPPGTTAVTLSGDSSYTTLQRV 
AGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRFVTVTAEALRFRQIQRGFRT 
TLDDLSGRSYVMTAEDVDLTI^NWGRLSSVLPDYHGQDSVRVGRISFGSINAILGSVALIL 
NCHHHASRVAR 



15D 

Sequence for Al region including His-Tag, protease Sensitive 
Region 



KGMRSHHHHHHHHIEGRASKSFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMI 
DSGSGDNLFAVDWGIDPEEGRFNNLRLIVERIWI.yVTGFVNRTfn!T^ 

GTTAVTLSGDSSYTTLQRVAGISRTGMQINPJiSLTTSYLDLMSHSGTSLTQSVARAMLRF 
VTVTAEA1:jRPRQIQRGPRTTLDDLSGRSYVMTAEDVDLTLNWGRL.SSVLPDYHGQDSVRV 
GR IS PGS INAI LGS VAL ILNCHHHASRVAR 



Figure 16 
16A 
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Sequence fox Al region with C242A 

KEFTLDFSTAKTYVDSLKVIRSAIGTPIjQTISSGaTSLLMIDSGSGDNLFAVDVRGIDPE 
EGRFNNLRL,IVERNNLYVTGFVNRTKK\fFYRFADFSHVTFPGTTAVTLSGDSSyTTLQRV 
AGISRTGMQINRHSLTTSYLDLMSHSGTSLTQSVARAMLRFVTVTAEALRFRQIQRGFRT 
TLDDLSGRS YVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRVGRI SFGS INAILGSVALI L 
NAHHHASRVJiR 



1€B 

Sequence for Al region with C242A and including HIS-Tag, 
Protease Sensitive Region 



KGMRSHHHHHHHHIEGRASKEFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMI 
DSGSGDWLFAVDVRGIDPEEGRFNNLRLIVERNNLYVTGFVNRTNNVFYRFADFSHVTFP 
GTTAVTLSGDSSyTTLQRVAGISRTGMQINRHSLTTSyLDLMSHSGTSLTQSVARAMLRP 
VTVTAEALRFRQIQRGFRTTLDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDSVRV 
GRISFGS INAILGSVALILNAHHHASRVAR 
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